{
 "cells": [
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "# HTML解析入门及准备URL生成连续技\n",
    "![for humans](https://requests-html.kennethreitz.org/_static/requests-html-logo.png#thumbnail)\n",
    "\n",
    "*  本周主要内容：HTML解析（parse HTML）及准备URL生成连续技\n",
    "*  上周主要内容：HTML解析（parse HTML）及Xpath实践\n",
    "*  20春_Web数据挖掘_week03\n",
    "*  电子讲义设计者：廖汉腾, 许智超\n",
    "<br/>\n",
    "<br/>\n",
    "\n",
    "-----\n",
    "## 复习\n",
    "\n",
    "复习：上周内容，实践\n",
    "\n",
    "* HTML解析（parse HTML）: requests-html  丶\n",
    "* Xpath实践\n",
    "* m.liepin.com 取工作牛肉\n",
    "\n",
    "-----\n",
    "## 本周内容及学习目标\n",
    "\n",
    "本周内容聚焦在\n",
    "\n",
    "<mark> 如何从一页开始有系统的找更多页的内容 </mark>\n",
    "\n",
    "为此，我们需要学习\n",
    "\n",
    "1. 拆解带有参数的URL，并再从query取出参数\n",
    "   a. URL拆解: 使用 urllib.parse 解析 出query\n",
    "   b. query拆解:  取出参数 成python字典\n",
    "2. 有基底URL，加上参数字典，请求新网页连续技\n",
    "\n",
    "我们除了继续学习解决上一周已开始面对的以下挑战：\n",
    "![Xpath Axis](http://krum.rz.uni-mannheim.de/inet-2005/images/xpath-axis.gif)\n",
    "\n",
    "### 旧目标\n",
    "1. 使用 requests-html 爬取并存取网页文字档，查找[requests-html 中文文档](https://cncert.github.io/requests-html-doc-cn/#/)\n",
    "2. 熟悉 [xpath 语法](https://www.w3cschool.cn/xpath/xpath-syntax.html)丶[xpath 节点](https://www.w3cschool.cn/xpath/xpath-nodes.html)\n",
    "3. 使用 [xpath cheatsheet](https://devhints.io/xpath)\n",
    "  * 在 Chrome Inspector 使用\n",
    "  * 在 requests-html (Python) 使用\n",
    "4. 简易使用 [pd.DataFrame](https://www.pypandas.cn/doc/getting_started/dsintro.html#dataframe)\n",
    "\n",
    "### 新目标\n",
    "这一周，学生将实践\n",
    "* 猎聘PC版 liepin.com 取工作URL参数的牛肉\n",
    "* 如何生成一连串新URL以进一步爬取数据\n",
    "\n",
    "\n"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 1,
   "metadata": {},
   "outputs": [
    {
     "data": {
      "text/html": [
       "<style>\n",
       "/* 本电子讲义使用之CSS */\n",
       "div.code_cell {\n",
       "    background-color: #e5f1fe;\n",
       "}\n",
       "div.cell.selected {\n",
       "    background-color: #effee2;\n",
       "    font-size: 2rem;\n",
       "    line-height: 2.4rem;\n",
       "}\n",
       "div.cell.selected .rendered_html table {\n",
       "    font-size: 2rem !important;\n",
       "    line-height: 2.4rem !important;\n",
       "}\n",
       ".rendered_html pre code {\n",
       "    background-color: #C4E4ff;   \n",
       "    padding: 2px 25px;\n",
       "}\n",
       ".rendered_html pre {\n",
       "    background-color: #99c9ff;\n",
       "}\n",
       "div.code_cell .CodeMirror {\n",
       "    font-size: 2rem !important;\n",
       "    line-height: 2.4rem !important;\n",
       "}\n",
       ".rendered_html img, .rendered_html svg {\n",
       "    max-width: 60%;\n",
       "    height: auto;\n",
       "    float: right;\n",
       "}\n",
       "\n",
       ".rendered_html img[src*=\"#full\"], .rendered_html svg[src*=\"#full\"] {\n",
       "    max-width: 100%;\n",
       "    height: auto;\n",
       "    float: none;\n",
       "}\n",
       "\n",
       ".rendered_html img[src*=\"#thumbnail\"], .rendered_html svg[src*=\"#thumbnail\"] {\n",
       "    max-width: 15%;\n",
       "    height: auto;\n",
       "}\n",
       "\n",
       "/* Gradient transparent - color - transparent */\n",
       "hr {\n",
       "    border: 0;\n",
       "    border-bottom: 1px dashed #ccc;\n",
       "}\n",
       ".emoticon{\n",
       "    font-size: 5rem;\n",
       "    line-height: 4.4rem;\n",
       "    text-align: center;\n",
       "    vertical-align: middle;\n",
       "}\n",
       ".bg-split_apply_comine {\n",
       "    width: 500px;     \n",
       "    height: 300px;\n",
       "    background: url('02_split-apply-comine_500x300.png') -10px -10px;\n",
       "    float: right;\n",
       "}\n",
       ".bg-comine {\n",
       "    width: 175px;\n",
       "    height: 150px;\n",
       "    background: url('02_split-apply-comine_500x300.png') -280px -80px;\n",
       "    float: right;\n",
       "}\n",
       ".bg-apply {\n",
       "    width: 155px;\n",
       "    height: 225px;\n",
       "    background: url('02_split-apply-comine_500x300.png') -160px -30px;\n",
       "    float: right;\n",
       "}\n",
       ".bg-split {\n",
       "    width: 205px;\n",
       "    height: 225px;\n",
       "    background: url('02_split-apply-comine_500x300.png') -10px -30px;\n",
       "    float: right;\n",
       "}\n",
       ".break {\n",
       "                   page-break-after: right; \n",
       "                   width:700px;\n",
       "                   clear:both;\n",
       "}\n",
       "</style>\n"
      ],
      "text/plain": [
       "<IPython.core.display.HTML object>"
      ]
     },
     "metadata": {},
     "output_type": "display_data"
    }
   ],
   "source": [
    "%%html\n",
    "<style>\n",
    "/* 本电子讲义使用之CSS */\n",
    "div.code_cell {\n",
    "    background-color: #e5f1fe;\n",
    "}\n",
    "div.cell.selected {\n",
    "    background-color: #effee2;\n",
    "    font-size: 2rem;\n",
    "    line-height: 2.4rem;\n",
    "}\n",
    "div.cell.selected .rendered_html table {\n",
    "    font-size: 2rem !important;\n",
    "    line-height: 2.4rem !important;\n",
    "}\n",
    ".rendered_html pre code {\n",
    "    background-color: #C4E4ff;   \n",
    "    padding: 2px 25px;\n",
    "}\n",
    ".rendered_html pre {\n",
    "    background-color: #99c9ff;\n",
    "}\n",
    "div.code_cell .CodeMirror {\n",
    "    font-size: 2rem !important;\n",
    "    line-height: 2.4rem !important;\n",
    "}\n",
    ".rendered_html img, .rendered_html svg {\n",
    "    max-width: 60%;\n",
    "    height: auto;\n",
    "    float: right;\n",
    "}\n",
    "\n",
    ".rendered_html img[src*=\"#full\"], .rendered_html svg[src*=\"#full\"] {\n",
    "    max-width: 100%;\n",
    "    height: auto;\n",
    "    float: none;\n",
    "}\n",
    "\n",
    ".rendered_html img[src*=\"#thumbnail\"], .rendered_html svg[src*=\"#thumbnail\"] {\n",
    "    max-width: 15%;\n",
    "    height: auto;\n",
    "}\n",
    "\n",
    "/* Gradient transparent - color - transparent */\n",
    "hr {\n",
    "    border: 0;\n",
    "    border-bottom: 1px dashed #ccc;\n",
    "}\n",
    ".emoticon{\n",
    "    font-size: 5rem;\n",
    "    line-height: 4.4rem;\n",
    "    text-align: center;\n",
    "    vertical-align: middle;\n",
    "}\n",
    ".bg-split_apply_comine {\n",
    "    width: 500px;     \n",
    "    height: 300px;\n",
    "    background: url('02_split-apply-comine_500x300.png') -10px -10px;\n",
    "    float: right;\n",
    "}\n",
    ".bg-comine {\n",
    "    width: 175px;\n",
    "    height: 150px;\n",
    "    background: url('02_split-apply-comine_500x300.png') -280px -80px;\n",
    "    float: right;\n",
    "}\n",
    ".bg-apply {\n",
    "    width: 155px;\n",
    "    height: 225px;\n",
    "    background: url('02_split-apply-comine_500x300.png') -160px -30px;\n",
    "    float: right;\n",
    "}\n",
    ".bg-split {\n",
    "    width: 205px;\n",
    "    height: 225px;\n",
    "    background: url('02_split-apply-comine_500x300.png') -10px -30px;\n",
    "    float: right;\n",
    "}\n",
    ".break {\n",
    "                   page-break-after: right; \n",
    "                   width:700px;\n",
    "                   clear:both;\n",
    "}\n",
    "</style>"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 2,
   "metadata": {},
   "outputs": [],
   "source": [
    "# 基本模块\n",
    "import pandas as pd\n",
    "from requests_html import HTMLSession"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "# 0. 上周加分作业解答"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 3,
   "metadata": {
    "scrolled": true
   },
   "outputs": [
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
      "[60, 60, 60, 60, 60, 60, 60, 60]\n"
     ]
    },
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th>职称</th>\n",
       "      <th>薪水</th>\n",
       "      <th>公司地点</th>\n",
       "      <th>公司名称</th>\n",
       "      <th>时间</th>\n",
       "      <th>经验</th>\n",
       "      <th>链结</th>\n",
       "      <th>公司URL</th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <td>0</td>\n",
       "      <td>教育科技 软件产品经理</td>\n",
       "      <td>12-18k·12薪</td>\n",
       "      <td>广州</td>\n",
       "      <td>融捷投资控股集团</td>\n",
       "      <td>22小时前</td>\n",
       "      <td>3年以上 统招本科</td>\n",
       "      <td>https://m.liepin.com/job/1922705123.shtml</td>\n",
       "      <td>https://m.liepin.com/company/8025674/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>1</td>\n",
       "      <td>旅游产品经理</td>\n",
       "      <td>12-20k·12薪</td>\n",
       "      <td>广州</td>\n",
       "      <td>前海爱讯科技(深圳)有限公司</td>\n",
       "      <td>前天</td>\n",
       "      <td>2年以上 统招本科</td>\n",
       "      <td>https://m.liepin.com/job/1926703515.shtml</td>\n",
       "      <td>https://m.liepin.com/company/8972310/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>2</td>\n",
       "      <td>产品经理</td>\n",
       "      <td>12-18k·12薪</td>\n",
       "      <td>广州-海珠区</td>\n",
       "      <td>广州大白互联网科技有限公司</td>\n",
       "      <td>2020-04-02</td>\n",
       "      <td>2年以上 本科及以上</td>\n",
       "      <td>https://m.liepin.com/job/1922402715.shtml</td>\n",
       "      <td>https://m.liepin.com/company/8695948/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>3</td>\n",
       "      <td>实施经理</td>\n",
       "      <td>16-23k·12薪</td>\n",
       "      <td>广州-大沙</td>\n",
       "      <td>广东卓志供应链服务集团有限公司</td>\n",
       "      <td>2020-04-01</td>\n",
       "      <td>5年以上 统招本科</td>\n",
       "      <td>https://m.liepin.com/job/1924985573.shtml</td>\n",
       "      <td>https://m.liepin.com/company/9238204/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>4</td>\n",
       "      <td>产品经理</td>\n",
       "      <td>20-25k·12薪</td>\n",
       "      <td>广州-番禺区</td>\n",
       "      <td>湖南狐轩服饰有限公司</td>\n",
       "      <td>2020-03-31</td>\n",
       "      <td>8年以上 大专及以上</td>\n",
       "      <td>https://m.liepin.com/job/1927101491.shtml</td>\n",
       "      <td>https://m.liepin.com/company/10202635/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>5</td>\n",
       "      <td>互联网产品经理</td>\n",
       "      <td>10-15k·12薪</td>\n",
       "      <td>广州-琶洲</td>\n",
       "      <td>广东车海洋环保科技有限公司</td>\n",
       "      <td>2020-03-20</td>\n",
       "      <td>3年以上 大专及以上</td>\n",
       "      <td>https://m.liepin.com/job/1917453193.shtml</td>\n",
       "      <td>https://m.liepin.com/company/9256869/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>6</td>\n",
       "      <td>后台产品经理</td>\n",
       "      <td>10-20k·12薪</td>\n",
       "      <td>广州</td>\n",
       "      <td>广东南方新媒体股份有限公司</td>\n",
       "      <td>一个月前</td>\n",
       "      <td>3年以上 本科及以上</td>\n",
       "      <td>https://m.liepin.com/job/1925126353.shtml</td>\n",
       "      <td>https://m.liepin.com/company/7889168/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>7</td>\n",
       "      <td>高级产品经理</td>\n",
       "      <td>20-25k·13薪</td>\n",
       "      <td>广州</td>\n",
       "      <td>某软件开发企业</td>\n",
       "      <td>3小时前</td>\n",
       "      <td>3年以上 大专及以上</td>\n",
       "      <td>https://m.liepin.com/a/18948933.shtml</td>\n",
       "      <td></td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>8</td>\n",
       "      <td>产品经理（电商系统）</td>\n",
       "      <td>25-40k·14薪</td>\n",
       "      <td>广东,深圳,广州</td>\n",
       "      <td>知名跨境电商公司</td>\n",
       "      <td>1小时前</td>\n",
       "      <td>3年以上 统招本科</td>\n",
       "      <td>https://m.liepin.com/a/18705133.shtml</td>\n",
       "      <td></td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>9</td>\n",
       "      <td>产品经理</td>\n",
       "      <td>18-30k·16薪</td>\n",
       "      <td>广州</td>\n",
       "      <td>某大型百强置业有限公司</td>\n",
       "      <td>2020-04-03</td>\n",
       "      <td>3年以上 统招本科</td>\n",
       "      <td>https://m.liepin.com/a/19609335.shtml</td>\n",
       "      <td></td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>10</td>\n",
       "      <td>ai产品经理</td>\n",
       "      <td>20-40k·13薪</td>\n",
       "      <td>北京,上海,广州</td>\n",
       "      <td>某互联网大数据云计算上市公司</td>\n",
       "      <td>2020-04-01</td>\n",
       "      <td>3年以上 统招本科</td>\n",
       "      <td>https://m.liepin.com/a/19603323.shtml</td>\n",
       "      <td></td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>11</td>\n",
       "      <td>WMS产品经理</td>\n",
       "      <td>20-35k·14薪</td>\n",
       "      <td>广州</td>\n",
       "      <td>某知名跨境电商平台</td>\n",
       "      <td>一个月前</td>\n",
       "      <td>2年以上 学历不限</td>\n",
       "      <td>https://m.liepin.com/a/18963147.shtml</td>\n",
       "      <td></td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>12</td>\n",
       "      <td>区块链产品经理</td>\n",
       "      <td>15-25k·12薪</td>\n",
       "      <td>广州-黄埔区</td>\n",
       "      <td>北京普瑞未来教育科技集团有限公司</td>\n",
       "      <td>一个月前</td>\n",
       "      <td>3年以上 大专及以上</td>\n",
       "      <td>https://m.liepin.com/job/1919835727.shtml</td>\n",
       "      <td>https://m.liepin.com/company/9989029/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>13</td>\n",
       "      <td>产品经理（支付/后端）</td>\n",
       "      <td>10-20k·12薪</td>\n",
       "      <td>广州-海珠区</td>\n",
       "      <td>北京路客互联网科技有限公司</td>\n",
       "      <td>一个月前</td>\n",
       "      <td>3年以上 本科及以上</td>\n",
       "      <td>https://m.liepin.com/job/1917750895.shtml</td>\n",
       "      <td>https://m.liepin.com/company/9284656/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>14</td>\n",
       "      <td>产品总监</td>\n",
       "      <td>50-70k·13薪</td>\n",
       "      <td>广州</td>\n",
       "      <td>名创优品</td>\n",
       "      <td>一个月前</td>\n",
       "      <td>8年以上 统招本科</td>\n",
       "      <td>https://m.liepin.com/job/1925389277.shtml</td>\n",
       "      <td>https://m.liepin.com/company/8392675/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>15</td>\n",
       "      <td>产品专员</td>\n",
       "      <td>5-8k·12薪</td>\n",
       "      <td>广州-海珠区</td>\n",
       "      <td>广州三易互联网科技有限公司</td>\n",
       "      <td>一个月前</td>\n",
       "      <td>经验不限 学历不限</td>\n",
       "      <td>https://m.liepin.com/job/1922364281.shtml</td>\n",
       "      <td>https://m.liepin.com/company/9647941/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>16</td>\n",
       "      <td>产品助理</td>\n",
       "      <td>5-8k·13薪</td>\n",
       "      <td>广州-海珠区</td>\n",
       "      <td>广州三易互联网科技有限公司</td>\n",
       "      <td>一个月前</td>\n",
       "      <td>经验不限 本科及以上</td>\n",
       "      <td>https://m.liepin.com/job/1922356557.shtml</td>\n",
       "      <td>https://m.liepin.com/company/9647941/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>17</td>\n",
       "      <td>产品经理</td>\n",
       "      <td>10-20k·12薪</td>\n",
       "      <td>广州-天河区</td>\n",
       "      <td>广州易达建信科技开发有限公司</td>\n",
       "      <td>一个月前</td>\n",
       "      <td>1年以上 大专及以上</td>\n",
       "      <td>https://m.liepin.com/job/1919464529.shtml</td>\n",
       "      <td>https://m.liepin.com/company/5493174/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>18</td>\n",
       "      <td>产品经理</td>\n",
       "      <td>14-22k·12薪</td>\n",
       "      <td>广州</td>\n",
       "      <td>锦江信息技术(广州)有限公司</td>\n",
       "      <td>一个月前</td>\n",
       "      <td>5年以上 本科及以上</td>\n",
       "      <td>https://m.liepin.com/job/1919024715.shtml</td>\n",
       "      <td>https://m.liepin.com/company/8973053/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>19</td>\n",
       "      <td>供应链产品经理</td>\n",
       "      <td>10-23k·12薪</td>\n",
       "      <td>广州-黄埔区</td>\n",
       "      <td>健客网</td>\n",
       "      <td>一个月前</td>\n",
       "      <td>2年以上 本科及以上</td>\n",
       "      <td>https://m.liepin.com/job/1914662183.shtml</td>\n",
       "      <td>https://m.liepin.com/company/582047/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>20</td>\n",
       "      <td>市场产品经理/主管</td>\n",
       "      <td>10-15k·12薪</td>\n",
       "      <td>广州</td>\n",
       "      <td>北京仲惠信息咨询有限公司</td>\n",
       "      <td>1小时前</td>\n",
       "      <td>3年以上 本科及以上</td>\n",
       "      <td>https://m.liepin.com/job/1927235433.shtml</td>\n",
       "      <td>https://m.liepin.com/company/12198453/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>21</td>\n",
       "      <td>高级金融产品经理</td>\n",
       "      <td>20-35k·15薪</td>\n",
       "      <td>广州-南沙区</td>\n",
       "      <td>暗物智能科技(广州)有限公司</td>\n",
       "      <td>2小时前</td>\n",
       "      <td>5年以上 本科及以上</td>\n",
       "      <td>https://m.liepin.com/job/1926378687.shtml</td>\n",
       "      <td>https://m.liepin.com/company/9671912/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>22</td>\n",
       "      <td>新媒体运营</td>\n",
       "      <td>8-15k·14薪</td>\n",
       "      <td>广州-花都区</td>\n",
       "      <td>联友科技</td>\n",
       "      <td>3小时前</td>\n",
       "      <td>1年以上 本科及以上</td>\n",
       "      <td>https://m.liepin.com/job/1925693659.shtml</td>\n",
       "      <td>https://m.liepin.com/company/1680117/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>23</td>\n",
       "      <td>内衣项目总经理 - 跨境电商出口行业</td>\n",
       "      <td>20-25k·12薪</td>\n",
       "      <td>广州-越秀区</td>\n",
       "      <td>文峰荟(广州)创业投资管理有限公司</td>\n",
       "      <td>59分钟前</td>\n",
       "      <td>10年以上 统招本科</td>\n",
       "      <td>https://m.liepin.com/job/1920988285.shtml</td>\n",
       "      <td>https://m.liepin.com/company/9412628/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>24</td>\n",
       "      <td>产品顾问（多媒体方向）</td>\n",
       "      <td>8-12k·12薪</td>\n",
       "      <td>广州-花都区</td>\n",
       "      <td>联友科技</td>\n",
       "      <td>3小时前</td>\n",
       "      <td>2年以上 本科及以上</td>\n",
       "      <td>https://m.liepin.com/job/1920316169.shtml</td>\n",
       "      <td>https://m.liepin.com/company/1680117/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>25</td>\n",
       "      <td>产品顾问（地图导航方向）</td>\n",
       "      <td>8-12k·12薪</td>\n",
       "      <td>广州-花都区</td>\n",
       "      <td>联友科技</td>\n",
       "      <td>3小时前</td>\n",
       "      <td>2年以上 本科及以上</td>\n",
       "      <td>https://m.liepin.com/job/1920316167.shtml</td>\n",
       "      <td>https://m.liepin.com/company/1680117/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>26</td>\n",
       "      <td>产品顾问（车载语音方向）</td>\n",
       "      <td>8-12k·12薪</td>\n",
       "      <td>广州-花都区</td>\n",
       "      <td>联友科技</td>\n",
       "      <td>3小时前</td>\n",
       "      <td>2年以上 本科及以上</td>\n",
       "      <td>https://m.liepin.com/job/1920316163.shtml</td>\n",
       "      <td>https://m.liepin.com/company/1680117/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>27</td>\n",
       "      <td>产品经理（OA）</td>\n",
       "      <td>10-15k·12薪</td>\n",
       "      <td>广州-天河区</td>\n",
       "      <td>佛山市艾臣家居科技有限公司</td>\n",
       "      <td>2小时前</td>\n",
       "      <td>3年以上 学历不限</td>\n",
       "      <td>https://m.liepin.com/job/1919955237.shtml</td>\n",
       "      <td>https://m.liepin.com/company/9220328/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>28</td>\n",
       "      <td>APP产品经理</td>\n",
       "      <td>2-3k·13薪</td>\n",
       "      <td>广州-荔湾区</td>\n",
       "      <td>某科技公司招聘APP产品经理</td>\n",
       "      <td>3小时前</td>\n",
       "      <td>5年以上 学历不限</td>\n",
       "      <td>https://m.liepin.com/a/19595503.shtml</td>\n",
       "      <td></td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>29</td>\n",
       "      <td>APP产品经理（知识付费 在线教育）</td>\n",
       "      <td>10-15k·12薪</td>\n",
       "      <td>广州-白云区</td>\n",
       "      <td>YY- 在线教育知识付费广州某公司</td>\n",
       "      <td>4小时前</td>\n",
       "      <td>3年以上 本科及以上</td>\n",
       "      <td>https://m.liepin.com/a/19570073.shtml</td>\n",
       "      <td></td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>30</td>\n",
       "      <td>产品总监</td>\n",
       "      <td>80-100k·13薪</td>\n",
       "      <td>北京-东城区,广州-海珠区,上海-徐汇区</td>\n",
       "      <td>某科技公司招聘产品总监</td>\n",
       "      <td>3小时前</td>\n",
       "      <td>5年以上 统招本科</td>\n",
       "      <td>https://m.liepin.com/a/19420579.shtml</td>\n",
       "      <td></td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>31</td>\n",
       "      <td>相机产品经理</td>\n",
       "      <td>25-45k·16薪</td>\n",
       "      <td>广州</td>\n",
       "      <td>某知名游戏公司</td>\n",
       "      <td>1小时前</td>\n",
       "      <td>5年以上 统招本科</td>\n",
       "      <td>https://m.liepin.com/a/18834745.shtml</td>\n",
       "      <td></td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>32</td>\n",
       "      <td>商家履约产品经理</td>\n",
       "      <td>25-40k·14薪</td>\n",
       "      <td>广州,杭州</td>\n",
       "      <td>瀚特英咨询</td>\n",
       "      <td>1小时前</td>\n",
       "      <td>3年以上 本科及以上</td>\n",
       "      <td>https://m.liepin.com/a/18744551.shtml</td>\n",
       "      <td></td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>33</td>\n",
       "      <td>直播产品运营经理</td>\n",
       "      <td>15-25k·13薪</td>\n",
       "      <td>广州</td>\n",
       "      <td>香港某传播有限公司</td>\n",
       "      <td>1小时前</td>\n",
       "      <td>5年以上 本科及以上</td>\n",
       "      <td>https://m.liepin.com/a/18725599.shtml</td>\n",
       "      <td></td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>34</td>\n",
       "      <td>直播产品经理</td>\n",
       "      <td>20-25k·13薪</td>\n",
       "      <td>广州</td>\n",
       "      <td>香港某传播有限公司</td>\n",
       "      <td>1小时前</td>\n",
       "      <td>3年以上 统招本科</td>\n",
       "      <td>https://m.liepin.com/a/18562407.shtml</td>\n",
       "      <td></td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>35</td>\n",
       "      <td>融捷健康智能电子公司-智能硬件产品经理</td>\n",
       "      <td>面议</td>\n",
       "      <td>广州</td>\n",
       "      <td>融捷投资控股集团</td>\n",
       "      <td>22小时前</td>\n",
       "      <td>5年以上 统招本科</td>\n",
       "      <td>https://m.liepin.com/job/1926705941.shtml</td>\n",
       "      <td>https://m.liepin.com/company/8025674/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>36</td>\n",
       "      <td>产品经理</td>\n",
       "      <td>20-40k·12薪</td>\n",
       "      <td>广州</td>\n",
       "      <td>卓尔人人</td>\n",
       "      <td>15小时前</td>\n",
       "      <td>3年以上 统招本科</td>\n",
       "      <td>https://m.liepin.com/job/1926419233.shtml</td>\n",
       "      <td>https://m.liepin.com/company/12146335/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>37</td>\n",
       "      <td>App产品经理</td>\n",
       "      <td>10-15k·13薪</td>\n",
       "      <td>广州</td>\n",
       "      <td>某科技有限公司</td>\n",
       "      <td>18小时前</td>\n",
       "      <td>3年以上 统招本科</td>\n",
       "      <td>https://m.liepin.com/a/19594403.shtml</td>\n",
       "      <td></td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>38</td>\n",
       "      <td>APP产品经理（知识付费/在线教育）</td>\n",
       "      <td>15-25k·12薪</td>\n",
       "      <td>广州</td>\n",
       "      <td>知识付费在线教育APP</td>\n",
       "      <td>昨天</td>\n",
       "      <td>2年以上 统招本科</td>\n",
       "      <td>https://m.liepin.com/a/19548311.shtml</td>\n",
       "      <td></td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>39</td>\n",
       "      <td>APP产品经理（知识付费/在线教育）</td>\n",
       "      <td>15-25k·12薪</td>\n",
       "      <td>广州-天河区</td>\n",
       "      <td>知识付费在线教育APP</td>\n",
       "      <td>昨天</td>\n",
       "      <td>3年以上 本科及以上</td>\n",
       "      <td>https://m.liepin.com/a/19524533.shtml</td>\n",
       "      <td></td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>40</td>\n",
       "      <td>APP产品经理（知识付费/在线教育）</td>\n",
       "      <td>10-15k·12薪</td>\n",
       "      <td>广州-天河区</td>\n",
       "      <td>知识付费在线教育APP</td>\n",
       "      <td>19小时前</td>\n",
       "      <td>3年以上 本科及以上</td>\n",
       "      <td>https://m.liepin.com/a/19483901.shtml</td>\n",
       "      <td></td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>41</td>\n",
       "      <td>数据产品</td>\n",
       "      <td>35-65k·16薪</td>\n",
       "      <td>北京,上海,广州</td>\n",
       "      <td>某互联网企业</td>\n",
       "      <td>昨天</td>\n",
       "      <td>3年以上 统招本科</td>\n",
       "      <td>https://m.liepin.com/a/19465809.shtml</td>\n",
       "      <td></td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>42</td>\n",
       "      <td>产品经理</td>\n",
       "      <td>18-30k·14薪</td>\n",
       "      <td>北京,广州,杭州</td>\n",
       "      <td>某大型互联网公司</td>\n",
       "      <td>18小时前</td>\n",
       "      <td>3年以上 大专及以上</td>\n",
       "      <td>https://m.liepin.com/a/19415683.shtml</td>\n",
       "      <td></td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>43</td>\n",
       "      <td>商家履约产品经理</td>\n",
       "      <td>25-50k·18薪</td>\n",
       "      <td>广东,广州,杭州</td>\n",
       "      <td>某大型知名跨境电商平台准独角兽（C+轮）</td>\n",
       "      <td>21小时前</td>\n",
       "      <td>3年以上 统招本科</td>\n",
       "      <td>https://m.liepin.com/a/19371747.shtml</td>\n",
       "      <td></td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>44</td>\n",
       "      <td>WMS产品经理</td>\n",
       "      <td>25-50k·18薪</td>\n",
       "      <td>广东,广州,杭州</td>\n",
       "      <td>某大型知名跨境电商平台准独角兽（C+轮）</td>\n",
       "      <td>21小时前</td>\n",
       "      <td>3年以上 统招本科</td>\n",
       "      <td>https://m.liepin.com/a/19370965.shtml</td>\n",
       "      <td></td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>45</td>\n",
       "      <td>WMS产品经理</td>\n",
       "      <td>25-40k·14薪</td>\n",
       "      <td>广州</td>\n",
       "      <td>hunter-in</td>\n",
       "      <td>19小时前</td>\n",
       "      <td>3年以上 本科及以上</td>\n",
       "      <td>https://m.liepin.com/a/18744427.shtml</td>\n",
       "      <td></td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>46</td>\n",
       "      <td>商家履约产品经理</td>\n",
       "      <td>25-40k·14薪</td>\n",
       "      <td>广州,浙江</td>\n",
       "      <td>hunter-in</td>\n",
       "      <td>19小时前</td>\n",
       "      <td>3年以上 本科及以上</td>\n",
       "      <td>https://m.liepin.com/a/18744381.shtml</td>\n",
       "      <td></td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>47</td>\n",
       "      <td>产品经理</td>\n",
       "      <td>12-24k·13薪</td>\n",
       "      <td>广州-天河区</td>\n",
       "      <td>360</td>\n",
       "      <td>昨天</td>\n",
       "      <td>3年以上 本科及以上</td>\n",
       "      <td>https://m.liepin.com/job/1927232311.shtml</td>\n",
       "      <td>https://m.liepin.com/company/6429309/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>48</td>\n",
       "      <td>产品经理</td>\n",
       "      <td>15-30k·14薪</td>\n",
       "      <td>广州-海珠区</td>\n",
       "      <td>信用生活(广州)智能科技有限公司</td>\n",
       "      <td>前天</td>\n",
       "      <td>3年以上 本科及以上</td>\n",
       "      <td>https://m.liepin.com/job/1927228991.shtml</td>\n",
       "      <td>https://m.liepin.com/company/9512616/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>49</td>\n",
       "      <td>资深产品经理（相机产品）</td>\n",
       "      <td>面议</td>\n",
       "      <td>广州</td>\n",
       "      <td>网易集团</td>\n",
       "      <td>前天</td>\n",
       "      <td>5年以上 本科及以上</td>\n",
       "      <td>https://m.liepin.com/job/1926534703.shtml</td>\n",
       "      <td>https://m.liepin.com/company/5964833/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>50</td>\n",
       "      <td>产品经理（用户体验改善）</td>\n",
       "      <td>15-25k·12薪</td>\n",
       "      <td>广州-天河区</td>\n",
       "      <td>嘟比英语</td>\n",
       "      <td>昨天</td>\n",
       "      <td>3年以上 统招本科</td>\n",
       "      <td>https://m.liepin.com/job/1926412121.shtml</td>\n",
       "      <td>https://m.liepin.com/company/12166375/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>51</td>\n",
       "      <td>APP产品经理</td>\n",
       "      <td>10-15k·13薪</td>\n",
       "      <td>广州-天河区</td>\n",
       "      <td>APP产品经理</td>\n",
       "      <td>昨天</td>\n",
       "      <td>3年以上 本科及以上</td>\n",
       "      <td>https://m.liepin.com/a/19601195.shtml</td>\n",
       "      <td></td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>52</td>\n",
       "      <td>APP产品经理(知识付费/在线教育)</td>\n",
       "      <td>10-12k·13薪</td>\n",
       "      <td>广州-天河区</td>\n",
       "      <td>互联网公司</td>\n",
       "      <td>昨天</td>\n",
       "      <td>经验不限 统招本科</td>\n",
       "      <td>https://m.liepin.com/a/19595423.shtml</td>\n",
       "      <td></td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>53</td>\n",
       "      <td>高级产品经理</td>\n",
       "      <td>20-30k·12薪</td>\n",
       "      <td>广州</td>\n",
       "      <td>中港合资传媒</td>\n",
       "      <td>昨天</td>\n",
       "      <td>5年以上 大专及以上</td>\n",
       "      <td>https://m.liepin.com/a/18725697.shtml</td>\n",
       "      <td></td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>54</td>\n",
       "      <td>直播产品运营经理</td>\n",
       "      <td>20-30k·13薪</td>\n",
       "      <td>广州-天河区</td>\n",
       "      <td>中港合资传媒</td>\n",
       "      <td>昨天</td>\n",
       "      <td>3年以上 统招本科</td>\n",
       "      <td>https://m.liepin.com/a/18711143.shtml</td>\n",
       "      <td></td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>55</td>\n",
       "      <td>产品经理</td>\n",
       "      <td>9-13k·12薪</td>\n",
       "      <td>广州-海珠区</td>\n",
       "      <td>广州共享出行科技有限公司</td>\n",
       "      <td>2020-04-03</td>\n",
       "      <td>3年以上 本科及以上</td>\n",
       "      <td>https://m.liepin.com/job/1927204563.shtml</td>\n",
       "      <td>https://m.liepin.com/company/10118191/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>56</td>\n",
       "      <td>产品经理</td>\n",
       "      <td>15-30k·12薪</td>\n",
       "      <td>广州-车陂</td>\n",
       "      <td>信用生活(广州)智能科技有限公司</td>\n",
       "      <td>2020-04-03</td>\n",
       "      <td>3年以上 统招本科</td>\n",
       "      <td>https://m.liepin.com/job/1927200673.shtml</td>\n",
       "      <td>https://m.liepin.com/company/9512616/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>57</td>\n",
       "      <td>国际支付产品专家</td>\n",
       "      <td>面议</td>\n",
       "      <td>广州-珠江新城</td>\n",
       "      <td>高盛国际人事顾问有限公司</td>\n",
       "      <td>前天</td>\n",
       "      <td>经验不限 本科及以上</td>\n",
       "      <td>https://m.liepin.com/job/1927186127.shtml</td>\n",
       "      <td>https://m.liepin.com/company/9766571/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>58</td>\n",
       "      <td>产品经理</td>\n",
       "      <td>15-25k·12薪</td>\n",
       "      <td>广州-海珠区</td>\n",
       "      <td>青木数字技术股份有限公司</td>\n",
       "      <td>2020-04-03</td>\n",
       "      <td>5年以上 统招本科</td>\n",
       "      <td>https://m.liepin.com/job/1927082439.shtml</td>\n",
       "      <td>https://m.liepin.com/company/12191983/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>59</td>\n",
       "      <td>产品经理（校园招聘）</td>\n",
       "      <td>8-12k·13薪</td>\n",
       "      <td>广州</td>\n",
       "      <td>佳都新太科技</td>\n",
       "      <td>2020-04-03</td>\n",
       "      <td>经验不限 统招本科</td>\n",
       "      <td>https://m.liepin.com/job/1927075137.shtml</td>\n",
       "      <td>https://m.liepin.com/company/2115085/</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "</div>"
      ],
      "text/plain": [
       "                      职称           薪水                  公司地点  \\\n",
       "0           教育科技 软件产品经理    12-18k·12薪                    广州   \n",
       "1                旅游产品经理    12-20k·12薪                    广州   \n",
       "2                  产品经理    12-18k·12薪                广州-海珠区   \n",
       "3                  实施经理    16-23k·12薪                 广州-大沙   \n",
       "4                  产品经理    20-25k·12薪                广州-番禺区   \n",
       "5               互联网产品经理    10-15k·12薪                 广州-琶洲   \n",
       "6                后台产品经理    10-20k·12薪                    广州   \n",
       "7                高级产品经理    20-25k·13薪                    广州   \n",
       "8            产品经理（电商系统）    25-40k·14薪              广东,深圳,广州   \n",
       "9                  产品经理    18-30k·16薪                    广州   \n",
       "10               ai产品经理    20-40k·13薪              北京,上海,广州   \n",
       "11              WMS产品经理    20-35k·14薪                    广州   \n",
       "12              区块链产品经理    15-25k·12薪                广州-黄埔区   \n",
       "13          产品经理（支付/后端）    10-20k·12薪                广州-海珠区   \n",
       "14                 产品总监    50-70k·13薪                    广州   \n",
       "15                 产品专员      5-8k·12薪                广州-海珠区   \n",
       "16                 产品助理      5-8k·13薪                广州-海珠区   \n",
       "17                 产品经理    10-20k·12薪                广州-天河区   \n",
       "18                 产品经理    14-22k·12薪                    广州   \n",
       "19              供应链产品经理    10-23k·12薪                广州-黄埔区   \n",
       "20            市场产品经理/主管    10-15k·12薪                    广州   \n",
       "21             高级金融产品经理    20-35k·15薪                广州-南沙区   \n",
       "22                新媒体运营     8-15k·14薪                广州-花都区   \n",
       "23   内衣项目总经理 - 跨境电商出口行业    20-25k·12薪                广州-越秀区   \n",
       "24          产品顾问（多媒体方向）     8-12k·12薪                广州-花都区   \n",
       "25         产品顾问（地图导航方向）     8-12k·12薪                广州-花都区   \n",
       "26         产品顾问（车载语音方向）     8-12k·12薪                广州-花都区   \n",
       "27             产品经理（OA）    10-15k·12薪                广州-天河区   \n",
       "28              APP产品经理      2-3k·13薪                广州-荔湾区   \n",
       "29   APP产品经理（知识付费 在线教育）    10-15k·12薪                广州-白云区   \n",
       "30                 产品总监   80-100k·13薪  北京-东城区,广州-海珠区,上海-徐汇区   \n",
       "31               相机产品经理    25-45k·16薪                    广州   \n",
       "32             商家履约产品经理    25-40k·14薪                 广州,杭州   \n",
       "33             直播产品运营经理    15-25k·13薪                    广州   \n",
       "34               直播产品经理    20-25k·13薪                    广州   \n",
       "35  融捷健康智能电子公司-智能硬件产品经理            面议                    广州   \n",
       "36                 产品经理    20-40k·12薪                    广州   \n",
       "37              App产品经理    10-15k·13薪                    广州   \n",
       "38   APP产品经理（知识付费/在线教育）    15-25k·12薪                    广州   \n",
       "39   APP产品经理（知识付费/在线教育）    15-25k·12薪                广州-天河区   \n",
       "40   APP产品经理（知识付费/在线教育）    10-15k·12薪                广州-天河区   \n",
       "41                 数据产品    35-65k·16薪              北京,上海,广州   \n",
       "42                 产品经理    18-30k·14薪              北京,广州,杭州   \n",
       "43             商家履约产品经理    25-50k·18薪              广东,广州,杭州   \n",
       "44              WMS产品经理    25-50k·18薪              广东,广州,杭州   \n",
       "45              WMS产品经理    25-40k·14薪                    广州   \n",
       "46             商家履约产品经理    25-40k·14薪                 广州,浙江   \n",
       "47                 产品经理    12-24k·13薪                广州-天河区   \n",
       "48                 产品经理    15-30k·14薪                广州-海珠区   \n",
       "49         资深产品经理（相机产品）            面议                    广州   \n",
       "50         产品经理（用户体验改善）    15-25k·12薪                广州-天河区   \n",
       "51              APP产品经理    10-15k·13薪                广州-天河区   \n",
       "52   APP产品经理(知识付费/在线教育)    10-12k·13薪                广州-天河区   \n",
       "53               高级产品经理    20-30k·12薪                    广州   \n",
       "54             直播产品运营经理    20-30k·13薪                广州-天河区   \n",
       "55                 产品经理     9-13k·12薪                广州-海珠区   \n",
       "56                 产品经理    15-30k·12薪                 广州-车陂   \n",
       "57             国际支付产品专家            面议               广州-珠江新城   \n",
       "58                 产品经理    15-25k·12薪                广州-海珠区   \n",
       "59           产品经理（校园招聘）     8-12k·13薪                    广州   \n",
       "\n",
       "                    公司名称          时间          经验  \\\n",
       "0               融捷投资控股集团       22小时前   3年以上 统招本科   \n",
       "1         前海爱讯科技(深圳)有限公司          前天   2年以上 统招本科   \n",
       "2          广州大白互联网科技有限公司  2020-04-02  2年以上 本科及以上   \n",
       "3        广东卓志供应链服务集团有限公司  2020-04-01   5年以上 统招本科   \n",
       "4             湖南狐轩服饰有限公司  2020-03-31  8年以上 大专及以上   \n",
       "5          广东车海洋环保科技有限公司  2020-03-20  3年以上 大专及以上   \n",
       "6          广东南方新媒体股份有限公司        一个月前  3年以上 本科及以上   \n",
       "7                某软件开发企业        3小时前  3年以上 大专及以上   \n",
       "8               知名跨境电商公司        1小时前   3年以上 统招本科   \n",
       "9            某大型百强置业有限公司  2020-04-03   3年以上 统招本科   \n",
       "10        某互联网大数据云计算上市公司  2020-04-01   3年以上 统招本科   \n",
       "11             某知名跨境电商平台        一个月前   2年以上 学历不限   \n",
       "12      北京普瑞未来教育科技集团有限公司        一个月前  3年以上 大专及以上   \n",
       "13         北京路客互联网科技有限公司        一个月前  3年以上 本科及以上   \n",
       "14                  名创优品        一个月前   8年以上 统招本科   \n",
       "15         广州三易互联网科技有限公司        一个月前   经验不限 学历不限   \n",
       "16         广州三易互联网科技有限公司        一个月前  经验不限 本科及以上   \n",
       "17        广州易达建信科技开发有限公司        一个月前  1年以上 大专及以上   \n",
       "18        锦江信息技术(广州)有限公司        一个月前  5年以上 本科及以上   \n",
       "19                   健客网        一个月前  2年以上 本科及以上   \n",
       "20          北京仲惠信息咨询有限公司        1小时前  3年以上 本科及以上   \n",
       "21        暗物智能科技(广州)有限公司        2小时前  5年以上 本科及以上   \n",
       "22                  联友科技        3小时前  1年以上 本科及以上   \n",
       "23     文峰荟(广州)创业投资管理有限公司       59分钟前  10年以上 统招本科   \n",
       "24                  联友科技        3小时前  2年以上 本科及以上   \n",
       "25                  联友科技        3小时前  2年以上 本科及以上   \n",
       "26                  联友科技        3小时前  2年以上 本科及以上   \n",
       "27         佛山市艾臣家居科技有限公司        2小时前   3年以上 学历不限   \n",
       "28        某科技公司招聘APP产品经理        3小时前   5年以上 学历不限   \n",
       "29     YY- 在线教育知识付费广州某公司        4小时前  3年以上 本科及以上   \n",
       "30           某科技公司招聘产品总监        3小时前   5年以上 统招本科   \n",
       "31               某知名游戏公司        1小时前   5年以上 统招本科   \n",
       "32                 瀚特英咨询        1小时前  3年以上 本科及以上   \n",
       "33             香港某传播有限公司        1小时前  5年以上 本科及以上   \n",
       "34             香港某传播有限公司        1小时前   3年以上 统招本科   \n",
       "35              融捷投资控股集团       22小时前   5年以上 统招本科   \n",
       "36                  卓尔人人       15小时前   3年以上 统招本科   \n",
       "37               某科技有限公司       18小时前   3年以上 统招本科   \n",
       "38           知识付费在线教育APP          昨天   2年以上 统招本科   \n",
       "39           知识付费在线教育APP          昨天  3年以上 本科及以上   \n",
       "40           知识付费在线教育APP       19小时前  3年以上 本科及以上   \n",
       "41                某互联网企业          昨天   3年以上 统招本科   \n",
       "42              某大型互联网公司       18小时前  3年以上 大专及以上   \n",
       "43  某大型知名跨境电商平台准独角兽（C+轮）       21小时前   3年以上 统招本科   \n",
       "44  某大型知名跨境电商平台准独角兽（C+轮）       21小时前   3年以上 统招本科   \n",
       "45             hunter-in       19小时前  3年以上 本科及以上   \n",
       "46             hunter-in       19小时前  3年以上 本科及以上   \n",
       "47                   360          昨天  3年以上 本科及以上   \n",
       "48      信用生活(广州)智能科技有限公司          前天  3年以上 本科及以上   \n",
       "49                  网易集团          前天  5年以上 本科及以上   \n",
       "50                  嘟比英语          昨天   3年以上 统招本科   \n",
       "51               APP产品经理          昨天  3年以上 本科及以上   \n",
       "52                 互联网公司          昨天   经验不限 统招本科   \n",
       "53                中港合资传媒          昨天  5年以上 大专及以上   \n",
       "54                中港合资传媒          昨天   3年以上 统招本科   \n",
       "55          广州共享出行科技有限公司  2020-04-03  3年以上 本科及以上   \n",
       "56      信用生活(广州)智能科技有限公司  2020-04-03   3年以上 统招本科   \n",
       "57          高盛国际人事顾问有限公司          前天  经验不限 本科及以上   \n",
       "58          青木数字技术股份有限公司  2020-04-03   5年以上 统招本科   \n",
       "59                佳都新太科技  2020-04-03   经验不限 统招本科   \n",
       "\n",
       "                                           链结  \\\n",
       "0   https://m.liepin.com/job/1922705123.shtml   \n",
       "1   https://m.liepin.com/job/1926703515.shtml   \n",
       "2   https://m.liepin.com/job/1922402715.shtml   \n",
       "3   https://m.liepin.com/job/1924985573.shtml   \n",
       "4   https://m.liepin.com/job/1927101491.shtml   \n",
       "5   https://m.liepin.com/job/1917453193.shtml   \n",
       "6   https://m.liepin.com/job/1925126353.shtml   \n",
       "7       https://m.liepin.com/a/18948933.shtml   \n",
       "8       https://m.liepin.com/a/18705133.shtml   \n",
       "9       https://m.liepin.com/a/19609335.shtml   \n",
       "10      https://m.liepin.com/a/19603323.shtml   \n",
       "11      https://m.liepin.com/a/18963147.shtml   \n",
       "12  https://m.liepin.com/job/1919835727.shtml   \n",
       "13  https://m.liepin.com/job/1917750895.shtml   \n",
       "14  https://m.liepin.com/job/1925389277.shtml   \n",
       "15  https://m.liepin.com/job/1922364281.shtml   \n",
       "16  https://m.liepin.com/job/1922356557.shtml   \n",
       "17  https://m.liepin.com/job/1919464529.shtml   \n",
       "18  https://m.liepin.com/job/1919024715.shtml   \n",
       "19  https://m.liepin.com/job/1914662183.shtml   \n",
       "20  https://m.liepin.com/job/1927235433.shtml   \n",
       "21  https://m.liepin.com/job/1926378687.shtml   \n",
       "22  https://m.liepin.com/job/1925693659.shtml   \n",
       "23  https://m.liepin.com/job/1920988285.shtml   \n",
       "24  https://m.liepin.com/job/1920316169.shtml   \n",
       "25  https://m.liepin.com/job/1920316167.shtml   \n",
       "26  https://m.liepin.com/job/1920316163.shtml   \n",
       "27  https://m.liepin.com/job/1919955237.shtml   \n",
       "28      https://m.liepin.com/a/19595503.shtml   \n",
       "29      https://m.liepin.com/a/19570073.shtml   \n",
       "30      https://m.liepin.com/a/19420579.shtml   \n",
       "31      https://m.liepin.com/a/18834745.shtml   \n",
       "32      https://m.liepin.com/a/18744551.shtml   \n",
       "33      https://m.liepin.com/a/18725599.shtml   \n",
       "34      https://m.liepin.com/a/18562407.shtml   \n",
       "35  https://m.liepin.com/job/1926705941.shtml   \n",
       "36  https://m.liepin.com/job/1926419233.shtml   \n",
       "37      https://m.liepin.com/a/19594403.shtml   \n",
       "38      https://m.liepin.com/a/19548311.shtml   \n",
       "39      https://m.liepin.com/a/19524533.shtml   \n",
       "40      https://m.liepin.com/a/19483901.shtml   \n",
       "41      https://m.liepin.com/a/19465809.shtml   \n",
       "42      https://m.liepin.com/a/19415683.shtml   \n",
       "43      https://m.liepin.com/a/19371747.shtml   \n",
       "44      https://m.liepin.com/a/19370965.shtml   \n",
       "45      https://m.liepin.com/a/18744427.shtml   \n",
       "46      https://m.liepin.com/a/18744381.shtml   \n",
       "47  https://m.liepin.com/job/1927232311.shtml   \n",
       "48  https://m.liepin.com/job/1927228991.shtml   \n",
       "49  https://m.liepin.com/job/1926534703.shtml   \n",
       "50  https://m.liepin.com/job/1926412121.shtml   \n",
       "51      https://m.liepin.com/a/19601195.shtml   \n",
       "52      https://m.liepin.com/a/19595423.shtml   \n",
       "53      https://m.liepin.com/a/18725697.shtml   \n",
       "54      https://m.liepin.com/a/18711143.shtml   \n",
       "55  https://m.liepin.com/job/1927204563.shtml   \n",
       "56  https://m.liepin.com/job/1927200673.shtml   \n",
       "57  https://m.liepin.com/job/1927186127.shtml   \n",
       "58  https://m.liepin.com/job/1927082439.shtml   \n",
       "59  https://m.liepin.com/job/1927075137.shtml   \n",
       "\n",
       "                                     公司URL  \n",
       "0    https://m.liepin.com/company/8025674/  \n",
       "1    https://m.liepin.com/company/8972310/  \n",
       "2    https://m.liepin.com/company/8695948/  \n",
       "3    https://m.liepin.com/company/9238204/  \n",
       "4   https://m.liepin.com/company/10202635/  \n",
       "5    https://m.liepin.com/company/9256869/  \n",
       "6    https://m.liepin.com/company/7889168/  \n",
       "7                                           \n",
       "8                                           \n",
       "9                                           \n",
       "10                                          \n",
       "11                                          \n",
       "12   https://m.liepin.com/company/9989029/  \n",
       "13   https://m.liepin.com/company/9284656/  \n",
       "14   https://m.liepin.com/company/8392675/  \n",
       "15   https://m.liepin.com/company/9647941/  \n",
       "16   https://m.liepin.com/company/9647941/  \n",
       "17   https://m.liepin.com/company/5493174/  \n",
       "18   https://m.liepin.com/company/8973053/  \n",
       "19    https://m.liepin.com/company/582047/  \n",
       "20  https://m.liepin.com/company/12198453/  \n",
       "21   https://m.liepin.com/company/9671912/  \n",
       "22   https://m.liepin.com/company/1680117/  \n",
       "23   https://m.liepin.com/company/9412628/  \n",
       "24   https://m.liepin.com/company/1680117/  \n",
       "25   https://m.liepin.com/company/1680117/  \n",
       "26   https://m.liepin.com/company/1680117/  \n",
       "27   https://m.liepin.com/company/9220328/  \n",
       "28                                          \n",
       "29                                          \n",
       "30                                          \n",
       "31                                          \n",
       "32                                          \n",
       "33                                          \n",
       "34                                          \n",
       "35   https://m.liepin.com/company/8025674/  \n",
       "36  https://m.liepin.com/company/12146335/  \n",
       "37                                          \n",
       "38                                          \n",
       "39                                          \n",
       "40                                          \n",
       "41                                          \n",
       "42                                          \n",
       "43                                          \n",
       "44                                          \n",
       "45                                          \n",
       "46                                          \n",
       "47   https://m.liepin.com/company/6429309/  \n",
       "48   https://m.liepin.com/company/9512616/  \n",
       "49   https://m.liepin.com/company/5964833/  \n",
       "50  https://m.liepin.com/company/12166375/  \n",
       "51                                          \n",
       "52                                          \n",
       "53                                          \n",
       "54                                          \n",
       "55  https://m.liepin.com/company/10118191/  \n",
       "56   https://m.liepin.com/company/9512616/  \n",
       "57   https://m.liepin.com/company/9766571/  \n",
       "58  https://m.liepin.com/company/12191983/  \n",
       "59   https://m.liepin.com/company/2115085/  "
      ]
     },
     "execution_count": 3,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "# C-1   单一页面\n",
    "url = \"https://m.liepin.com/zhaopin/?keyword=PRD\"\n",
    "session = HTMLSession()\n",
    "r = session.get( url )\n",
    "\n",
    "# C-5\n",
    "# 难: '公司URL', '时间', '经验'\n",
    "\n",
    "# 先取特定元素, 精准打击其‘子后辈’\n",
    "主要元素 = r.html.xpath( \\\n",
    "    '//div[@class=\"job-card-wrap\"]//div[@class=\"job-card\"]')\n",
    "\n",
    "# 作为xpath字典，键为我要抓的牛肉名称，值为xpath\n",
    "dict_xpaths={ \n",
    "    'text': {\n",
    "        '经验':      './/ul/li[time]/text()'\n",
    "    },\n",
    "    'text_content': {\n",
    "        '职称':    './/ul/li/a[contains(@class,\"job-name\")]/span[@class=\"name-text\"]', \n",
    "        '薪水':    './/ul/li/a[contains(@class,\"job-name\")]/following-sibling::span', \n",
    "        '公司地点':'.//ul/li/time/following-sibling::a',\n",
    "        '公司名称': './/ul/li/a[contains(@class,\"company-name\")]', \n",
    "        '时间':    './/ul/li/time', \n",
    "    },\n",
    "    'href': {\n",
    "        '链结':    './/ul/li/a[contains(@class,\"job-name\")]', \n",
    "        '公司URL': './/ul/li/a[contains(@class,\"company-name\")]', \n",
    "    }\n",
    "}\n",
    "\n",
    "def get_e_text_content(_xpath_):\n",
    "    # 高级列表推导\n",
    "    暂存结果 = [e.xpath(_xpath_)[0].lxml.text_content() for e in 主要元素]\n",
    "    return(暂存结果)\n",
    "\n",
    "def get_e_text(_xpath_):\n",
    "    # 高级列表推导\n",
    "    暂存结果 = [\"\".join([x.strip() for x in e.xpath(_xpath_)]) for e in 主要元素]\n",
    "    return(暂存结果)\n",
    "\n",
    "def get_e_href(_xpath_):\n",
    "    # 高级列表推导\n",
    "    暂存结果 = [list(e.xpath(_xpath_, first=True).absolute_links)[0] \\\n",
    "               if len(e.xpath(_xpath_, first=True).absolute_links) >= 1  \\\n",
    "               else \"\" for e in 主要元素]\n",
    "    return(暂存结果)\n",
    "\n",
    "# 只对主要元素下进行.xpath取值\n",
    "数据字典 = dict()\n",
    "\n",
    "数据字典 = {k:get_e_text_content(v) for k,v in dict_xpaths['text_content'].items()}\n",
    "数据字典.update({k:get_e_text(v) for k,v in dict_xpaths['text'].items()})\n",
    "数据字典.update({k:get_e_href(v) for k,v in dict_xpaths['href'].items()})\n",
    "\n",
    "print ([len(v) for k,v in 数据字典.items()])  # 檢查\n",
    "\n",
    "数据 = pd.DataFrame(数据字典)\n",
    "数据.to_excel(\"20春_Web数据挖掘_week02_liepin.xlsx\", sheet_name=\"搜查结果\")\n",
    "数据 "
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 4,
   "metadata": {},
   "outputs": [
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
      "[60, 60, 60, 60, 60, 60, 60, 60]\n"
     ]
    },
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th>职称</th>\n",
       "      <th>薪水</th>\n",
       "      <th>公司地点</th>\n",
       "      <th>公司名称</th>\n",
       "      <th>时间</th>\n",
       "      <th>经验</th>\n",
       "      <th>链结</th>\n",
       "      <th>公司URL</th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <td>0</td>\n",
       "      <td>网络在线/课程咨询师/美女团队/带薪培训S</td>\n",
       "      <td>6-12k·12薪</td>\n",
       "      <td>广州</td>\n",
       "      <td>深圳市升学文化传播有限公司</td>\n",
       "      <td>一个月前</td>\n",
       "      <td>经验不限 中专/中技及以上</td>\n",
       "      <td>https://m.liepin.com/job/1919198869.shtml</td>\n",
       "      <td>https://m.liepin.com/company/8770412/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>1</td>\n",
       "      <td>人才发展主管</td>\n",
       "      <td>面议</td>\n",
       "      <td>广州-天河区</td>\n",
       "      <td>广州细刻网络科技有限公司</td>\n",
       "      <td>3小时前</td>\n",
       "      <td>2年以上 统招本科</td>\n",
       "      <td>https://m.liepin.com/job/1925813851.shtml</td>\n",
       "      <td>https://m.liepin.com/company/9911671/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>2</td>\n",
       "      <td>HRBP主管/经理</td>\n",
       "      <td>12-20k·13薪</td>\n",
       "      <td>广州</td>\n",
       "      <td>广州细刻网络科技有限公司</td>\n",
       "      <td>23小时前</td>\n",
       "      <td>5年以上 大专及以上</td>\n",
       "      <td>https://m.liepin.com/job/1926656193.shtml</td>\n",
       "      <td>https://m.liepin.com/company/9911671/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>3</td>\n",
       "      <td>SEM广告投放（Facebook/Google）</td>\n",
       "      <td>8-12k·12薪</td>\n",
       "      <td>广州</td>\n",
       "      <td>广州细刻网络科技有限公司</td>\n",
       "      <td>20小时前</td>\n",
       "      <td>1年以上 大专及以上</td>\n",
       "      <td>https://m.liepin.com/job/1926507605.shtml</td>\n",
       "      <td>https://m.liepin.com/company/9911671/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>4</td>\n",
       "      <td>服装买手</td>\n",
       "      <td>8-12k·12薪</td>\n",
       "      <td>广州</td>\n",
       "      <td>广州细刻网络科技有限公司</td>\n",
       "      <td>20小时前</td>\n",
       "      <td>1年以上 大专及以上</td>\n",
       "      <td>https://m.liepin.com/job/1926507563.shtml</td>\n",
       "      <td>https://m.liepin.com/company/9911671/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>5</td>\n",
       "      <td>海外广告投放经理（FB/GG）</td>\n",
       "      <td>10-15k·12薪</td>\n",
       "      <td>广州</td>\n",
       "      <td>广州细刻网络科技有限公司</td>\n",
       "      <td>20小时前</td>\n",
       "      <td>5年以上 本科及以上</td>\n",
       "      <td>https://m.liepin.com/job/1925681279.shtml</td>\n",
       "      <td>https://m.liepin.com/company/9911671/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>6</td>\n",
       "      <td>网站运营经理</td>\n",
       "      <td>10-15k·12薪</td>\n",
       "      <td>广州</td>\n",
       "      <td>广州细刻网络科技有限公司</td>\n",
       "      <td>20小时前</td>\n",
       "      <td>5年以上 本科及以上</td>\n",
       "      <td>https://m.liepin.com/job/1925680965.shtml</td>\n",
       "      <td>https://m.liepin.com/company/9911671/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>7</td>\n",
       "      <td>商品经理（女装/鞋子）</td>\n",
       "      <td>10-15k·12薪</td>\n",
       "      <td>广州</td>\n",
       "      <td>广州细刻网络科技有限公司</td>\n",
       "      <td>20小时前</td>\n",
       "      <td>5年以上 本科及以上</td>\n",
       "      <td>https://m.liepin.com/job/1925680857.shtml</td>\n",
       "      <td>https://m.liepin.com/company/9911671/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>8</td>\n",
       "      <td>买手经理（女装/鞋子）</td>\n",
       "      <td>10-15k·12薪</td>\n",
       "      <td>广州</td>\n",
       "      <td>广州细刻网络科技有限公司</td>\n",
       "      <td>20小时前</td>\n",
       "      <td>5年以上 本科及以上</td>\n",
       "      <td>https://m.liepin.com/job/1925680341.shtml</td>\n",
       "      <td>https://m.liepin.com/company/9911671/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>9</td>\n",
       "      <td>鞋子设计师</td>\n",
       "      <td>8-12k·12薪</td>\n",
       "      <td>广州-天河区</td>\n",
       "      <td>广州细刻网络科技有限公司</td>\n",
       "      <td>23小时前</td>\n",
       "      <td>3年以上 大专及以上</td>\n",
       "      <td>https://m.liepin.com/job/1921029797.shtml</td>\n",
       "      <td>https://m.liepin.com/company/9911671/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>10</td>\n",
       "      <td>服装开发</td>\n",
       "      <td>6-8k·12薪</td>\n",
       "      <td>广州</td>\n",
       "      <td>广州细刻网络科技有限公司</td>\n",
       "      <td>23小时前</td>\n",
       "      <td>2年以上 大专及以上</td>\n",
       "      <td>https://m.liepin.com/job/1920093991.shtml</td>\n",
       "      <td>https://m.liepin.com/company/9911671/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>11</td>\n",
       "      <td>女装设计师</td>\n",
       "      <td>7-10k·12薪</td>\n",
       "      <td>广州</td>\n",
       "      <td>广州细刻网络科技有限公司</td>\n",
       "      <td>23小时前</td>\n",
       "      <td>2年以上 大专及以上</td>\n",
       "      <td>https://m.liepin.com/job/1920093987.shtml</td>\n",
       "      <td>https://m.liepin.com/company/9911671/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>12</td>\n",
       "      <td>服装买手</td>\n",
       "      <td>4-8k·12薪</td>\n",
       "      <td>广州</td>\n",
       "      <td>广州细刻网络科技有限公司</td>\n",
       "      <td>23小时前</td>\n",
       "      <td>经验不限 大专及以上</td>\n",
       "      <td>https://m.liepin.com/job/1920093971.shtml</td>\n",
       "      <td>https://m.liepin.com/company/9911671/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>13</td>\n",
       "      <td>Facebook投放经理</td>\n",
       "      <td>15-25k·12薪</td>\n",
       "      <td>广州</td>\n",
       "      <td>广州细刻网络科技有限公司</td>\n",
       "      <td>23小时前</td>\n",
       "      <td>5年以上 大专及以上</td>\n",
       "      <td>https://m.liepin.com/job/1920093945.shtml</td>\n",
       "      <td>https://m.liepin.com/company/9911671/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>14</td>\n",
       "      <td>销售代表</td>\n",
       "      <td>10-20k·12薪</td>\n",
       "      <td>广州-新市</td>\n",
       "      <td>济南邦德激光股份有限公司</td>\n",
       "      <td>2020-04-02</td>\n",
       "      <td>2年以上 大专及以上</td>\n",
       "      <td>https://m.liepin.com/job/1926896949.shtml</td>\n",
       "      <td>https://m.liepin.com/company/8734171/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>15</td>\n",
       "      <td>摄影师</td>\n",
       "      <td>7-10k·12薪</td>\n",
       "      <td>广州</td>\n",
       "      <td>广州细刻网络科技有限公司</td>\n",
       "      <td>2020-03-12</td>\n",
       "      <td>3年以上 本科及以上</td>\n",
       "      <td>https://m.liepin.com/job/1926359141.shtml</td>\n",
       "      <td>https://m.liepin.com/company/9911671/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>16</td>\n",
       "      <td>HRBP(服装/互联网行业）</td>\n",
       "      <td>12-20k·13薪</td>\n",
       "      <td>广州</td>\n",
       "      <td>广州细刻网络科技有限公司</td>\n",
       "      <td>2020-03-12</td>\n",
       "      <td>5年以上 本科及以上</td>\n",
       "      <td>https://m.liepin.com/job/1925832523.shtml</td>\n",
       "      <td>https://m.liepin.com/company/9911671/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>17</td>\n",
       "      <td>高级招聘专员</td>\n",
       "      <td>5-8k·12薪</td>\n",
       "      <td>广州-天河区</td>\n",
       "      <td>广州细刻网络科技有限公司</td>\n",
       "      <td>一个月前</td>\n",
       "      <td>2年以上 本科及以上</td>\n",
       "      <td>https://m.liepin.com/job/1925721417.shtml</td>\n",
       "      <td>https://m.liepin.com/company/9911671/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>18</td>\n",
       "      <td>服饰采购经理</td>\n",
       "      <td>10-15k·12薪</td>\n",
       "      <td>广州</td>\n",
       "      <td>广州细刻网络科技有限公司</td>\n",
       "      <td>一个月前</td>\n",
       "      <td>4年以上 大专及以上</td>\n",
       "      <td>https://m.liepin.com/job/1923836679.shtml</td>\n",
       "      <td>https://m.liepin.com/company/9911671/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>19</td>\n",
       "      <td>外贸业务员（包食宿）</td>\n",
       "      <td>4-8k·12薪</td>\n",
       "      <td>广州</td>\n",
       "      <td>广东依莱雅纺织有限公司</td>\n",
       "      <td>一个月前</td>\n",
       "      <td>1年以上 大专及以上</td>\n",
       "      <td>https://m.liepin.com/job/1922582861.shtml</td>\n",
       "      <td>https://m.liepin.com/company/9376258/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>20</td>\n",
       "      <td>新媒体运营专员</td>\n",
       "      <td>3-6k·12薪</td>\n",
       "      <td>广州</td>\n",
       "      <td>广东优冠生物</td>\n",
       "      <td>一个月前</td>\n",
       "      <td>经验不限 大专及以上</td>\n",
       "      <td>https://m.liepin.com/job/1920896175.shtml</td>\n",
       "      <td>https://m.liepin.com/company/8845857/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>21</td>\n",
       "      <td>文案专员</td>\n",
       "      <td>3-6k·12薪</td>\n",
       "      <td>广州</td>\n",
       "      <td>广东优冠生物</td>\n",
       "      <td>一个月前</td>\n",
       "      <td>经验不限 大专及以上</td>\n",
       "      <td>https://m.liepin.com/job/1920895893.shtml</td>\n",
       "      <td>https://m.liepin.com/company/8845857/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>22</td>\n",
       "      <td>面辅料采购</td>\n",
       "      <td>4-6k·12薪</td>\n",
       "      <td>广州</td>\n",
       "      <td>细刻科技</td>\n",
       "      <td>一个月前</td>\n",
       "      <td>1年以上 本科及以上</td>\n",
       "      <td>https://m.liepin.com/job/1917413835.shtml</td>\n",
       "      <td>https://m.liepin.com/company/8599903/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>23</td>\n",
       "      <td>买手主管/经理</td>\n",
       "      <td>8-15k·12薪</td>\n",
       "      <td>广州</td>\n",
       "      <td>细刻科技</td>\n",
       "      <td>一个月前</td>\n",
       "      <td>3年以上 大专及以上</td>\n",
       "      <td>https://m.liepin.com/job/1916673777.shtml</td>\n",
       "      <td>https://m.liepin.com/company/8599903/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>24</td>\n",
       "      <td>阿里巴巴运营</td>\n",
       "      <td>8-15k·12薪</td>\n",
       "      <td>广州</td>\n",
       "      <td>细刻科技</td>\n",
       "      <td>一个月前</td>\n",
       "      <td>3年以上 大专及以上</td>\n",
       "      <td>https://m.liepin.com/job/1916376163.shtml</td>\n",
       "      <td>https://m.liepin.com/company/8599903/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>25</td>\n",
       "      <td>供应链高级经理</td>\n",
       "      <td>15-25k·12薪</td>\n",
       "      <td>广州</td>\n",
       "      <td>细刻科技</td>\n",
       "      <td>一个月前</td>\n",
       "      <td>5年以上 大专及以上</td>\n",
       "      <td>https://m.liepin.com/job/1916376159.shtml</td>\n",
       "      <td>https://m.liepin.com/company/8599903/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>26</td>\n",
       "      <td>用户运营专员</td>\n",
       "      <td>8-10k·12薪</td>\n",
       "      <td>广州</td>\n",
       "      <td>细刻科技</td>\n",
       "      <td>一个月前</td>\n",
       "      <td>1年以上 本科及以上</td>\n",
       "      <td>https://m.liepin.com/job/1916126737.shtml</td>\n",
       "      <td>https://m.liepin.com/company/8599903/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>27</td>\n",
       "      <td>php开发工程师</td>\n",
       "      <td>10-15k·12薪</td>\n",
       "      <td>广州</td>\n",
       "      <td>细刻科技</td>\n",
       "      <td>一个月前</td>\n",
       "      <td>3年以上 本科及以上</td>\n",
       "      <td>https://m.liepin.com/job/1915946645.shtml</td>\n",
       "      <td>https://m.liepin.com/company/8599903/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>28</td>\n",
       "      <td>Javascript</td>\n",
       "      <td>7-12k·12薪</td>\n",
       "      <td>广州-花都区</td>\n",
       "      <td>碧曼雅国际美容养生馆(广州)有限公司</td>\n",
       "      <td>3小时前</td>\n",
       "      <td>2年以上 本科及以上</td>\n",
       "      <td>https://m.liepin.com/job/1926806587.shtml</td>\n",
       "      <td>https://m.liepin.com/company/10138657/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>29</td>\n",
       "      <td>银行保险续保专员</td>\n",
       "      <td>面议</td>\n",
       "      <td>广州-天河区</td>\n",
       "      <td>中国人寿</td>\n",
       "      <td>5小时前</td>\n",
       "      <td>1年以上 大专及以上</td>\n",
       "      <td>https://m.liepin.com/job/1919886121.shtml</td>\n",
       "      <td>https://m.liepin.com/company/9235779/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>30</td>\n",
       "      <td>客户营销主管</td>\n",
       "      <td>6-10k·12薪</td>\n",
       "      <td>广州</td>\n",
       "      <td>喀斯玛(北京)科技有限公司</td>\n",
       "      <td>昨天</td>\n",
       "      <td>1年以上 大专及以上</td>\n",
       "      <td>https://m.liepin.com/job/1927210955.shtml</td>\n",
       "      <td>https://m.liepin.com/company/10100863/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>31</td>\n",
       "      <td>B2B营销主管</td>\n",
       "      <td>6-8k·12薪</td>\n",
       "      <td>广州</td>\n",
       "      <td>喀斯玛(北京)科技有限公司</td>\n",
       "      <td>昨天</td>\n",
       "      <td>1年以上 大专及以上</td>\n",
       "      <td>https://m.liepin.com/job/1927201035.shtml</td>\n",
       "      <td>https://m.liepin.com/company/10100863/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>32</td>\n",
       "      <td>客户服务专员/助理</td>\n",
       "      <td>5-10k·12薪</td>\n",
       "      <td>广州-海珠区</td>\n",
       "      <td>广州贝思教育科技有限公司</td>\n",
       "      <td>20小时前</td>\n",
       "      <td>经验不限 中专/中技及以上</td>\n",
       "      <td>https://m.liepin.com/job/1926514447.shtml</td>\n",
       "      <td>https://m.liepin.com/company/10094505/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>33</td>\n",
       "      <td>法务专员</td>\n",
       "      <td>6-8k·12薪</td>\n",
       "      <td>广州-天河区</td>\n",
       "      <td>广州细刻网络科技有限公司</td>\n",
       "      <td>23小时前</td>\n",
       "      <td>1年以上 统招本科</td>\n",
       "      <td>https://m.liepin.com/job/1925704905.shtml</td>\n",
       "      <td>https://m.liepin.com/company/9911671/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>34</td>\n",
       "      <td>中级java开发工程师</td>\n",
       "      <td>7-10k·12薪</td>\n",
       "      <td>广州</td>\n",
       "      <td>高亚科技</td>\n",
       "      <td>昨天</td>\n",
       "      <td>3年以上 本科及以上</td>\n",
       "      <td>https://m.liepin.com/job/1922252199.shtml</td>\n",
       "      <td>https://m.liepin.com/company/1552347/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>35</td>\n",
       "      <td>销售代表</td>\n",
       "      <td>6-10k·12薪</td>\n",
       "      <td>广州</td>\n",
       "      <td>宏飞国际投资咨询(深圳)有限公司广州分公司</td>\n",
       "      <td>20小时前</td>\n",
       "      <td>经验不限 大专及以上</td>\n",
       "      <td>https://m.liepin.com/job/1922108027.shtml</td>\n",
       "      <td>https://m.liepin.com/company/10011421/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>36</td>\n",
       "      <td>理财规划师管理培训生</td>\n",
       "      <td>6-10k·12薪</td>\n",
       "      <td>广州</td>\n",
       "      <td>宏飞国际投资咨询(深圳)有限公司广州分公司</td>\n",
       "      <td>20小时前</td>\n",
       "      <td>经验不限 学历不限</td>\n",
       "      <td>https://m.liepin.com/job/1920427401.shtml</td>\n",
       "      <td>https://m.liepin.com/company/10011421/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>37</td>\n",
       "      <td>Financial Planner</td>\n",
       "      <td>8-12k·12薪</td>\n",
       "      <td>广州</td>\n",
       "      <td>宏飞国际投资咨询(深圳)有限公司广州分公司</td>\n",
       "      <td>20小时前</td>\n",
       "      <td>经验不限 本科及以上</td>\n",
       "      <td>https://m.liepin.com/job/1920427347.shtml</td>\n",
       "      <td>https://m.liepin.com/company/10011421/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>38</td>\n",
       "      <td>English support（英语支持）</td>\n",
       "      <td>4-8k·12薪</td>\n",
       "      <td>广州-天河区</td>\n",
       "      <td>高亚科技</td>\n",
       "      <td>昨天</td>\n",
       "      <td>1年以上 本科及以上</td>\n",
       "      <td>https://m.liepin.com/job/1919214919.shtml</td>\n",
       "      <td>https://m.liepin.com/company/1552347/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>39</td>\n",
       "      <td>供应链产品经理</td>\n",
       "      <td>12-18k·12薪</td>\n",
       "      <td>广州</td>\n",
       "      <td>广州细刻网络科技有限公司</td>\n",
       "      <td>23小时前</td>\n",
       "      <td>4年以上 大专及以上</td>\n",
       "      <td>https://m.liepin.com/job/1918756491.shtml</td>\n",
       "      <td>https://m.liepin.com/company/9911671/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>40</td>\n",
       "      <td>Java开发工程师</td>\n",
       "      <td>6-8k·13薪</td>\n",
       "      <td>广州-天河区</td>\n",
       "      <td>高亚科技</td>\n",
       "      <td>昨天</td>\n",
       "      <td>1年以上 统招本科</td>\n",
       "      <td>https://m.liepin.com/job/1918716743.shtml</td>\n",
       "      <td>https://m.liepin.com/company/1552347/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>41</td>\n",
       "      <td>电话销售</td>\n",
       "      <td>5-10k·12薪</td>\n",
       "      <td>广州</td>\n",
       "      <td>广州益嘉科技服务有限公司</td>\n",
       "      <td>2020-04-03</td>\n",
       "      <td>2年以上 中专/中技及以上</td>\n",
       "      <td>https://m.liepin.com/job/1927202629.shtml</td>\n",
       "      <td>https://m.liepin.com/company/12197319/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>42</td>\n",
       "      <td>大客户经理</td>\n",
       "      <td>5-10k·12薪</td>\n",
       "      <td>广州</td>\n",
       "      <td>广州共享出行科技有限公司</td>\n",
       "      <td>2020-04-03</td>\n",
       "      <td>2年以上 本科及以上</td>\n",
       "      <td>https://m.liepin.com/job/1926157057.shtml</td>\n",
       "      <td>https://m.liepin.com/company/10118191/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>43</td>\n",
       "      <td>课程顾问\\无需外出\\五险一金\\不加班</td>\n",
       "      <td>6-12k·12薪</td>\n",
       "      <td>广州</td>\n",
       "      <td>深圳市中教文化传播有限公司广州分公司</td>\n",
       "      <td>前天</td>\n",
       "      <td>经验不限 学历不限</td>\n",
       "      <td>https://m.liepin.com/job/1924886091.shtml</td>\n",
       "      <td>https://m.liepin.com/company/8797760/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>44</td>\n",
       "      <td>不加班诚聘课程顾问</td>\n",
       "      <td>6-10k·12薪</td>\n",
       "      <td>广州</td>\n",
       "      <td>深圳市中教文化传播有限公司广州分公司</td>\n",
       "      <td>前天</td>\n",
       "      <td>1年以上 学历不限</td>\n",
       "      <td>https://m.liepin.com/job/1923463091.shtml</td>\n",
       "      <td>https://m.liepin.com/company/8797760/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>45</td>\n",
       "      <td>营销推广策划</td>\n",
       "      <td>8-15k·12薪</td>\n",
       "      <td>广州</td>\n",
       "      <td>四三九九信息科技</td>\n",
       "      <td>前天</td>\n",
       "      <td>2年以上 统招本科</td>\n",
       "      <td>https://m.liepin.com/job/197482129.shtml</td>\n",
       "      <td>https://m.liepin.com/company/1040016/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>46</td>\n",
       "      <td>项目经理</td>\n",
       "      <td>8-15k·12薪</td>\n",
       "      <td>广州-海珠区</td>\n",
       "      <td>房多多</td>\n",
       "      <td>2020-04-01</td>\n",
       "      <td>3年以上 本科及以上</td>\n",
       "      <td>https://m.liepin.com/job/1924333215.shtml</td>\n",
       "      <td>https://m.liepin.com/company/7871137/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>47</td>\n",
       "      <td>新房通产品运营部总（副）总经理(J10573)</td>\n",
       "      <td>30-50k·12薪</td>\n",
       "      <td>广州</td>\n",
       "      <td>房多多</td>\n",
       "      <td>2020-04-01</td>\n",
       "      <td>10年以上 本科及以上</td>\n",
       "      <td>https://m.liepin.com/job/1924200717.shtml</td>\n",
       "      <td>https://m.liepin.com/company/7871137/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>48</td>\n",
       "      <td>房地产交易/中介</td>\n",
       "      <td>4-9k·12薪</td>\n",
       "      <td>广州</td>\n",
       "      <td>广州华江置业有限公司</td>\n",
       "      <td>2020-03-27</td>\n",
       "      <td>经验不限 学历不限</td>\n",
       "      <td>https://m.liepin.com/job/1926972993.shtml</td>\n",
       "      <td>https://m.liepin.com/company/9822899/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>49</td>\n",
       "      <td>主播经纪人</td>\n",
       "      <td>5-8k·12薪</td>\n",
       "      <td>广州</td>\n",
       "      <td>广州凯乐网络科技有限公司</td>\n",
       "      <td>2020-03-26</td>\n",
       "      <td>经验不限 学历不限</td>\n",
       "      <td>https://m.liepin.com/job/1926982701.shtml</td>\n",
       "      <td>https://m.liepin.com/company/9735493/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>50</td>\n",
       "      <td>测试工程师</td>\n",
       "      <td>11-20k·12薪</td>\n",
       "      <td>广州</td>\n",
       "      <td>广州细刻网络科技有限公司</td>\n",
       "      <td>2020-03-26</td>\n",
       "      <td>3年以上 本科及以上</td>\n",
       "      <td>https://m.liepin.com/job/1926982197.shtml</td>\n",
       "      <td>https://m.liepin.com/company/9911671/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>51</td>\n",
       "      <td>电话销售</td>\n",
       "      <td>10-15k·12薪</td>\n",
       "      <td>广州</td>\n",
       "      <td>广州华江置业有限公司</td>\n",
       "      <td>2020-03-26</td>\n",
       "      <td>经验不限 学历不限</td>\n",
       "      <td>https://m.liepin.com/job/1926973067.shtml</td>\n",
       "      <td>https://m.liepin.com/company/9822899/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>52</td>\n",
       "      <td>经理助理（女上司）</td>\n",
       "      <td>4-5k·12薪</td>\n",
       "      <td>广州-太和</td>\n",
       "      <td>广州百印电子有限公司</td>\n",
       "      <td>2020-03-26</td>\n",
       "      <td>经验不限 大专及以上</td>\n",
       "      <td>https://m.liepin.com/job/1923720107.shtml</td>\n",
       "      <td>https://m.liepin.com/company/10179021/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>53</td>\n",
       "      <td>外贸专员</td>\n",
       "      <td>4-8k·12薪</td>\n",
       "      <td>广州-白云区</td>\n",
       "      <td>广州百印电子有限公司</td>\n",
       "      <td>2020-03-26</td>\n",
       "      <td>1年以上 学历不限</td>\n",
       "      <td>https://m.liepin.com/job/1923038999.shtml</td>\n",
       "      <td>https://m.liepin.com/company/10179021/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>54</td>\n",
       "      <td>PHP开发工程师</td>\n",
       "      <td>12-20k·12薪</td>\n",
       "      <td>广州-天河区</td>\n",
       "      <td>广州细刻网络科技有限公司</td>\n",
       "      <td>2020-03-25</td>\n",
       "      <td>3年以上 本科及以上</td>\n",
       "      <td>https://m.liepin.com/job/1926620051.shtml</td>\n",
       "      <td>https://m.liepin.com/company/9911671/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>55</td>\n",
       "      <td>销售主管</td>\n",
       "      <td>20-25k·12薪</td>\n",
       "      <td>广州</td>\n",
       "      <td>中国人寿</td>\n",
       "      <td>2020-03-23</td>\n",
       "      <td>1年以上 大专及以上</td>\n",
       "      <td>https://m.liepin.com/job/1920145379.shtml</td>\n",
       "      <td>https://m.liepin.com/company/9813547/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>56</td>\n",
       "      <td>销售经理</td>\n",
       "      <td>7-8k·12薪</td>\n",
       "      <td>广州</td>\n",
       "      <td>上海玺智汽车租赁有限公司</td>\n",
       "      <td>2020-03-21</td>\n",
       "      <td>经验不限 大专及以上</td>\n",
       "      <td>https://m.liepin.com/job/1926847725.shtml</td>\n",
       "      <td>https://m.liepin.com/company/10091189/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>57</td>\n",
       "      <td>前端开发工程师</td>\n",
       "      <td>12-20k·12薪</td>\n",
       "      <td>广州-天河区</td>\n",
       "      <td>广州细刻网络科技有限公司</td>\n",
       "      <td>2020-03-12</td>\n",
       "      <td>3年以上 本科及以上</td>\n",
       "      <td>https://m.liepin.com/job/1926620141.shtml</td>\n",
       "      <td>https://m.liepin.com/company/9911671/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>58</td>\n",
       "      <td>产品经理</td>\n",
       "      <td>15-25k·12薪</td>\n",
       "      <td>广州-天河区</td>\n",
       "      <td>广州细刻网络科技有限公司</td>\n",
       "      <td>2020-03-12</td>\n",
       "      <td>3年以上 本科及以上</td>\n",
       "      <td>https://m.liepin.com/job/1925228971.shtml</td>\n",
       "      <td>https://m.liepin.com/company/9911671/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>59</td>\n",
       "      <td>供应链产品经理</td>\n",
       "      <td>20-25k·12薪</td>\n",
       "      <td>广州-天河区</td>\n",
       "      <td>广州细刻网络科技有限公司</td>\n",
       "      <td>2020-03-12</td>\n",
       "      <td>4年以上 本科及以上</td>\n",
       "      <td>https://m.liepin.com/job/1924389209.shtml</td>\n",
       "      <td>https://m.liepin.com/company/9911671/</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "</div>"
      ],
      "text/plain": [
       "                           职称          薪水    公司地点                   公司名称  \\\n",
       "0      网络在线/课程咨询师/美女团队/带薪培训S    6-12k·12薪      广州          深圳市升学文化传播有限公司   \n",
       "1                     人才发展主管           面议  广州-天河区           广州细刻网络科技有限公司   \n",
       "2                  HRBP主管/经理   12-20k·13薪      广州           广州细刻网络科技有限公司   \n",
       "3   SEM广告投放（Facebook/Google）    8-12k·12薪      广州           广州细刻网络科技有限公司   \n",
       "4                       服装买手    8-12k·12薪      广州           广州细刻网络科技有限公司   \n",
       "5            海外广告投放经理（FB/GG）   10-15k·12薪      广州           广州细刻网络科技有限公司   \n",
       "6                     网站运营经理   10-15k·12薪      广州           广州细刻网络科技有限公司   \n",
       "7                商品经理（女装/鞋子）   10-15k·12薪      广州           广州细刻网络科技有限公司   \n",
       "8                买手经理（女装/鞋子）   10-15k·12薪      广州           广州细刻网络科技有限公司   \n",
       "9                      鞋子设计师    8-12k·12薪  广州-天河区           广州细刻网络科技有限公司   \n",
       "10                      服装开发     6-8k·12薪      广州           广州细刻网络科技有限公司   \n",
       "11                     女装设计师    7-10k·12薪      广州           广州细刻网络科技有限公司   \n",
       "12                      服装买手     4-8k·12薪      广州           广州细刻网络科技有限公司   \n",
       "13              Facebook投放经理   15-25k·12薪      广州           广州细刻网络科技有限公司   \n",
       "14                      销售代表   10-20k·12薪   广州-新市           济南邦德激光股份有限公司   \n",
       "15                       摄影师    7-10k·12薪      广州           广州细刻网络科技有限公司   \n",
       "16            HRBP(服装/互联网行业）   12-20k·13薪      广州           广州细刻网络科技有限公司   \n",
       "17                    高级招聘专员     5-8k·12薪  广州-天河区           广州细刻网络科技有限公司   \n",
       "18                    服饰采购经理   10-15k·12薪      广州           广州细刻网络科技有限公司   \n",
       "19                外贸业务员（包食宿）     4-8k·12薪      广州            广东依莱雅纺织有限公司   \n",
       "20                   新媒体运营专员     3-6k·12薪      广州                 广东优冠生物   \n",
       "21                      文案专员     3-6k·12薪      广州                 广东优冠生物   \n",
       "22                     面辅料采购     4-6k·12薪      广州                   细刻科技   \n",
       "23                   买手主管/经理    8-15k·12薪      广州                   细刻科技   \n",
       "24                    阿里巴巴运营    8-15k·12薪      广州                   细刻科技   \n",
       "25                   供应链高级经理   15-25k·12薪      广州                   细刻科技   \n",
       "26                    用户运营专员    8-10k·12薪      广州                   细刻科技   \n",
       "27                  php开发工程师   10-15k·12薪      广州                   细刻科技   \n",
       "28                Javascript    7-12k·12薪  广州-花都区     碧曼雅国际美容养生馆(广州)有限公司   \n",
       "29                  银行保险续保专员           面议  广州-天河区                   中国人寿   \n",
       "30                    客户营销主管    6-10k·12薪      广州          喀斯玛(北京)科技有限公司   \n",
       "31                   B2B营销主管     6-8k·12薪      广州          喀斯玛(北京)科技有限公司   \n",
       "32                 客户服务专员/助理    5-10k·12薪  广州-海珠区           广州贝思教育科技有限公司   \n",
       "33                      法务专员     6-8k·12薪  广州-天河区           广州细刻网络科技有限公司   \n",
       "34               中级java开发工程师    7-10k·12薪      广州                   高亚科技   \n",
       "35                      销售代表    6-10k·12薪      广州  宏飞国际投资咨询(深圳)有限公司广州分公司   \n",
       "36                理财规划师管理培训生    6-10k·12薪      广州  宏飞国际投资咨询(深圳)有限公司广州分公司   \n",
       "37         Financial Planner    8-12k·12薪      广州  宏飞国际投资咨询(深圳)有限公司广州分公司   \n",
       "38     English support（英语支持）     4-8k·12薪  广州-天河区                   高亚科技   \n",
       "39                   供应链产品经理   12-18k·12薪      广州           广州细刻网络科技有限公司   \n",
       "40                 Java开发工程师     6-8k·13薪  广州-天河区                   高亚科技   \n",
       "41                      电话销售    5-10k·12薪      广州           广州益嘉科技服务有限公司   \n",
       "42                     大客户经理    5-10k·12薪      广州           广州共享出行科技有限公司   \n",
       "43        课程顾问\\无需外出\\五险一金\\不加班    6-12k·12薪      广州     深圳市中教文化传播有限公司广州分公司   \n",
       "44                 不加班诚聘课程顾问    6-10k·12薪      广州     深圳市中教文化传播有限公司广州分公司   \n",
       "45                    营销推广策划    8-15k·12薪      广州               四三九九信息科技   \n",
       "46                      项目经理    8-15k·12薪  广州-海珠区                    房多多   \n",
       "47   新房通产品运营部总（副）总经理(J10573)   30-50k·12薪      广州                    房多多   \n",
       "48                  房地产交易/中介     4-9k·12薪      广州             广州华江置业有限公司   \n",
       "49                     主播经纪人     5-8k·12薪      广州           广州凯乐网络科技有限公司   \n",
       "50                     测试工程师   11-20k·12薪      广州           广州细刻网络科技有限公司   \n",
       "51                      电话销售   10-15k·12薪      广州             广州华江置业有限公司   \n",
       "52                 经理助理（女上司）     4-5k·12薪   广州-太和             广州百印电子有限公司   \n",
       "53                      外贸专员     4-8k·12薪  广州-白云区             广州百印电子有限公司   \n",
       "54                  PHP开发工程师   12-20k·12薪  广州-天河区           广州细刻网络科技有限公司   \n",
       "55                      销售主管   20-25k·12薪      广州                   中国人寿   \n",
       "56                      销售经理     7-8k·12薪      广州           上海玺智汽车租赁有限公司   \n",
       "57                   前端开发工程师   12-20k·12薪  广州-天河区           广州细刻网络科技有限公司   \n",
       "58                      产品经理   15-25k·12薪  广州-天河区           广州细刻网络科技有限公司   \n",
       "59                   供应链产品经理   20-25k·12薪  广州-天河区           广州细刻网络科技有限公司   \n",
       "\n",
       "            时间             经验                                         链结  \\\n",
       "0         一个月前  经验不限 中专/中技及以上  https://m.liepin.com/job/1919198869.shtml   \n",
       "1         3小时前      2年以上 统招本科  https://m.liepin.com/job/1925813851.shtml   \n",
       "2        23小时前     5年以上 大专及以上  https://m.liepin.com/job/1926656193.shtml   \n",
       "3        20小时前     1年以上 大专及以上  https://m.liepin.com/job/1926507605.shtml   \n",
       "4        20小时前     1年以上 大专及以上  https://m.liepin.com/job/1926507563.shtml   \n",
       "5        20小时前     5年以上 本科及以上  https://m.liepin.com/job/1925681279.shtml   \n",
       "6        20小时前     5年以上 本科及以上  https://m.liepin.com/job/1925680965.shtml   \n",
       "7        20小时前     5年以上 本科及以上  https://m.liepin.com/job/1925680857.shtml   \n",
       "8        20小时前     5年以上 本科及以上  https://m.liepin.com/job/1925680341.shtml   \n",
       "9        23小时前     3年以上 大专及以上  https://m.liepin.com/job/1921029797.shtml   \n",
       "10       23小时前     2年以上 大专及以上  https://m.liepin.com/job/1920093991.shtml   \n",
       "11       23小时前     2年以上 大专及以上  https://m.liepin.com/job/1920093987.shtml   \n",
       "12       23小时前     经验不限 大专及以上  https://m.liepin.com/job/1920093971.shtml   \n",
       "13       23小时前     5年以上 大专及以上  https://m.liepin.com/job/1920093945.shtml   \n",
       "14  2020-04-02     2年以上 大专及以上  https://m.liepin.com/job/1926896949.shtml   \n",
       "15  2020-03-12     3年以上 本科及以上  https://m.liepin.com/job/1926359141.shtml   \n",
       "16  2020-03-12     5年以上 本科及以上  https://m.liepin.com/job/1925832523.shtml   \n",
       "17        一个月前     2年以上 本科及以上  https://m.liepin.com/job/1925721417.shtml   \n",
       "18        一个月前     4年以上 大专及以上  https://m.liepin.com/job/1923836679.shtml   \n",
       "19        一个月前     1年以上 大专及以上  https://m.liepin.com/job/1922582861.shtml   \n",
       "20        一个月前     经验不限 大专及以上  https://m.liepin.com/job/1920896175.shtml   \n",
       "21        一个月前     经验不限 大专及以上  https://m.liepin.com/job/1920895893.shtml   \n",
       "22        一个月前     1年以上 本科及以上  https://m.liepin.com/job/1917413835.shtml   \n",
       "23        一个月前     3年以上 大专及以上  https://m.liepin.com/job/1916673777.shtml   \n",
       "24        一个月前     3年以上 大专及以上  https://m.liepin.com/job/1916376163.shtml   \n",
       "25        一个月前     5年以上 大专及以上  https://m.liepin.com/job/1916376159.shtml   \n",
       "26        一个月前     1年以上 本科及以上  https://m.liepin.com/job/1916126737.shtml   \n",
       "27        一个月前     3年以上 本科及以上  https://m.liepin.com/job/1915946645.shtml   \n",
       "28        3小时前     2年以上 本科及以上  https://m.liepin.com/job/1926806587.shtml   \n",
       "29        5小时前     1年以上 大专及以上  https://m.liepin.com/job/1919886121.shtml   \n",
       "30          昨天     1年以上 大专及以上  https://m.liepin.com/job/1927210955.shtml   \n",
       "31          昨天     1年以上 大专及以上  https://m.liepin.com/job/1927201035.shtml   \n",
       "32       20小时前  经验不限 中专/中技及以上  https://m.liepin.com/job/1926514447.shtml   \n",
       "33       23小时前      1年以上 统招本科  https://m.liepin.com/job/1925704905.shtml   \n",
       "34          昨天     3年以上 本科及以上  https://m.liepin.com/job/1922252199.shtml   \n",
       "35       20小时前     经验不限 大专及以上  https://m.liepin.com/job/1922108027.shtml   \n",
       "36       20小时前      经验不限 学历不限  https://m.liepin.com/job/1920427401.shtml   \n",
       "37       20小时前     经验不限 本科及以上  https://m.liepin.com/job/1920427347.shtml   \n",
       "38          昨天     1年以上 本科及以上  https://m.liepin.com/job/1919214919.shtml   \n",
       "39       23小时前     4年以上 大专及以上  https://m.liepin.com/job/1918756491.shtml   \n",
       "40          昨天      1年以上 统招本科  https://m.liepin.com/job/1918716743.shtml   \n",
       "41  2020-04-03  2年以上 中专/中技及以上  https://m.liepin.com/job/1927202629.shtml   \n",
       "42  2020-04-03     2年以上 本科及以上  https://m.liepin.com/job/1926157057.shtml   \n",
       "43          前天      经验不限 学历不限  https://m.liepin.com/job/1924886091.shtml   \n",
       "44          前天      1年以上 学历不限  https://m.liepin.com/job/1923463091.shtml   \n",
       "45          前天      2年以上 统招本科   https://m.liepin.com/job/197482129.shtml   \n",
       "46  2020-04-01     3年以上 本科及以上  https://m.liepin.com/job/1924333215.shtml   \n",
       "47  2020-04-01    10年以上 本科及以上  https://m.liepin.com/job/1924200717.shtml   \n",
       "48  2020-03-27      经验不限 学历不限  https://m.liepin.com/job/1926972993.shtml   \n",
       "49  2020-03-26      经验不限 学历不限  https://m.liepin.com/job/1926982701.shtml   \n",
       "50  2020-03-26     3年以上 本科及以上  https://m.liepin.com/job/1926982197.shtml   \n",
       "51  2020-03-26      经验不限 学历不限  https://m.liepin.com/job/1926973067.shtml   \n",
       "52  2020-03-26     经验不限 大专及以上  https://m.liepin.com/job/1923720107.shtml   \n",
       "53  2020-03-26      1年以上 学历不限  https://m.liepin.com/job/1923038999.shtml   \n",
       "54  2020-03-25     3年以上 本科及以上  https://m.liepin.com/job/1926620051.shtml   \n",
       "55  2020-03-23     1年以上 大专及以上  https://m.liepin.com/job/1920145379.shtml   \n",
       "56  2020-03-21     经验不限 大专及以上  https://m.liepin.com/job/1926847725.shtml   \n",
       "57  2020-03-12     3年以上 本科及以上  https://m.liepin.com/job/1926620141.shtml   \n",
       "58  2020-03-12     3年以上 本科及以上  https://m.liepin.com/job/1925228971.shtml   \n",
       "59  2020-03-12     4年以上 本科及以上  https://m.liepin.com/job/1924389209.shtml   \n",
       "\n",
       "                                     公司URL  \n",
       "0    https://m.liepin.com/company/8770412/  \n",
       "1    https://m.liepin.com/company/9911671/  \n",
       "2    https://m.liepin.com/company/9911671/  \n",
       "3    https://m.liepin.com/company/9911671/  \n",
       "4    https://m.liepin.com/company/9911671/  \n",
       "5    https://m.liepin.com/company/9911671/  \n",
       "6    https://m.liepin.com/company/9911671/  \n",
       "7    https://m.liepin.com/company/9911671/  \n",
       "8    https://m.liepin.com/company/9911671/  \n",
       "9    https://m.liepin.com/company/9911671/  \n",
       "10   https://m.liepin.com/company/9911671/  \n",
       "11   https://m.liepin.com/company/9911671/  \n",
       "12   https://m.liepin.com/company/9911671/  \n",
       "13   https://m.liepin.com/company/9911671/  \n",
       "14   https://m.liepin.com/company/8734171/  \n",
       "15   https://m.liepin.com/company/9911671/  \n",
       "16   https://m.liepin.com/company/9911671/  \n",
       "17   https://m.liepin.com/company/9911671/  \n",
       "18   https://m.liepin.com/company/9911671/  \n",
       "19   https://m.liepin.com/company/9376258/  \n",
       "20   https://m.liepin.com/company/8845857/  \n",
       "21   https://m.liepin.com/company/8845857/  \n",
       "22   https://m.liepin.com/company/8599903/  \n",
       "23   https://m.liepin.com/company/8599903/  \n",
       "24   https://m.liepin.com/company/8599903/  \n",
       "25   https://m.liepin.com/company/8599903/  \n",
       "26   https://m.liepin.com/company/8599903/  \n",
       "27   https://m.liepin.com/company/8599903/  \n",
       "28  https://m.liepin.com/company/10138657/  \n",
       "29   https://m.liepin.com/company/9235779/  \n",
       "30  https://m.liepin.com/company/10100863/  \n",
       "31  https://m.liepin.com/company/10100863/  \n",
       "32  https://m.liepin.com/company/10094505/  \n",
       "33   https://m.liepin.com/company/9911671/  \n",
       "34   https://m.liepin.com/company/1552347/  \n",
       "35  https://m.liepin.com/company/10011421/  \n",
       "36  https://m.liepin.com/company/10011421/  \n",
       "37  https://m.liepin.com/company/10011421/  \n",
       "38   https://m.liepin.com/company/1552347/  \n",
       "39   https://m.liepin.com/company/9911671/  \n",
       "40   https://m.liepin.com/company/1552347/  \n",
       "41  https://m.liepin.com/company/12197319/  \n",
       "42  https://m.liepin.com/company/10118191/  \n",
       "43   https://m.liepin.com/company/8797760/  \n",
       "44   https://m.liepin.com/company/8797760/  \n",
       "45   https://m.liepin.com/company/1040016/  \n",
       "46   https://m.liepin.com/company/7871137/  \n",
       "47   https://m.liepin.com/company/7871137/  \n",
       "48   https://m.liepin.com/company/9822899/  \n",
       "49   https://m.liepin.com/company/9735493/  \n",
       "50   https://m.liepin.com/company/9911671/  \n",
       "51   https://m.liepin.com/company/9822899/  \n",
       "52  https://m.liepin.com/company/10179021/  \n",
       "53  https://m.liepin.com/company/10179021/  \n",
       "54   https://m.liepin.com/company/9911671/  \n",
       "55   https://m.liepin.com/company/9813547/  \n",
       "56  https://m.liepin.com/company/10091189/  \n",
       "57   https://m.liepin.com/company/9911671/  \n",
       "58   https://m.liepin.com/company/9911671/  \n",
       "59   https://m.liepin.com/company/9911671/  "
      ]
     },
     "execution_count": 4,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "# C-1   单一页面\n",
    "url = \"https://m.liepin.com/zhaopin/?keyword=美女\"\n",
    "session = HTMLSession()\n",
    "r = session.get( url )\n",
    "\n",
    "# C-5\n",
    "# 难: '公司URL', '时间', '经验'\n",
    "\n",
    "# 先取特定元素, 精准打击其‘子后辈’\n",
    "主要元素 = r.html.xpath( \\\n",
    "    '//div[@class=\"job-card-wrap\"]//div[@class=\"job-card\"]')\n",
    "\n",
    "# 作为xpath字典，键为我要抓的牛肉名称，值为xpath\n",
    "dict_xpaths={ \n",
    "    'text': {\n",
    "        '经验':      './/ul/li[time]/text()'\n",
    "    },\n",
    "    'text_content': {\n",
    "        '职称':    './/ul/li/a[contains(@class,\"job-name\")]/span[@class=\"name-text\"]', \n",
    "        '薪水':    './/ul/li/a[contains(@class,\"job-name\")]/following-sibling::span', \n",
    "        '公司地点':'.//ul/li/time/following-sibling::a',\n",
    "        '公司名称': './/ul/li/a[contains(@class,\"company-name\")]', \n",
    "        '时间':    './/ul/li/time', \n",
    "    },\n",
    "    'href': {\n",
    "        '链结':    './/ul/li/a[contains(@class,\"job-name\")]', \n",
    "        '公司URL': './/ul/li/a[contains(@class,\"company-name\")]', \n",
    "    }\n",
    "}\n",
    "\n",
    "def get_e_text_content(_xpath_):\n",
    "    # 高级列表推导\n",
    "    暂存结果 = [e.xpath(_xpath_)[0].lxml.text_content() for e in 主要元素]\n",
    "    return(暂存结果)\n",
    "\n",
    "def get_e_text(_xpath_):\n",
    "    # 高级列表推导\n",
    "    暂存结果 = [\"\".join([x.strip() for x in e.xpath(_xpath_)]) for e in 主要元素]\n",
    "    return(暂存结果)\n",
    "\n",
    "def get_e_href(_xpath_):\n",
    "    # 高级列表推导\n",
    "    暂存结果 = [list(e.xpath(_xpath_, first=True).absolute_links)[0] \\\n",
    "               if len(e.xpath(_xpath_, first=True).absolute_links) >= 1  \\\n",
    "               else \"\" for e in 主要元素]\n",
    "    return(暂存结果)\n",
    "\n",
    "# 只对主要元素下进行.xpath取值\n",
    "数据字典 = dict()\n",
    "\n",
    "数据字典 = {k:get_e_text_content(v) for k,v in dict_xpaths['text_content'].items()}\n",
    "数据字典.update({k:get_e_text(v) for k,v in dict_xpaths['text'].items()})\n",
    "数据字典.update({k:get_e_href(v) for k,v in dict_xpaths['href'].items()})\n",
    "\n",
    "# print([len(v) for k,v in 数据字典.items()])   # 检查数据量，长度...\n",
    "\n",
    "数据 = pd.DataFrame(数据字典)\n",
    "数据.to_excel(\"20春_Web数据挖掘_week04_liepin.xlsx\", sheet_name=\"搜查结果\")\n",
    "数据 "
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "#### 小坑/小风格\n",
    "* 代码某几行最后一个字符有 \\，指的是什麽意思？\n",
    "* 代码某几行最后一个字符有 \\，为什麽要用？给机器还是人用的？\n",
    "* 代码某几行最后一个字符有 \\，若后面多了空白会怎麽样？\n",
    "\n",
    "----\n",
    "答案: 和机器说，此行代码未结束，下行继续，最主要是让**写**程序的人可以合法回车，目标主要是为了让**读**代码的人可以因为好的回车排版，更易懂代码的意义"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "\n",
    "-----\n",
    "\n",
    "# 本周目标\n",
    "* [猎聘PC版](https://www.liepin.com/zhaopin/)\n",
    "* 上方导航有  公司行业 城市 薪资 的分页选单\n",
    "* 请练习xpath抽出数据"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "## Xpath解析HTML"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 6,
   "metadata": {},
   "outputs": [
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th>edu</th>\n",
       "      <th>经验</th>\n",
       "      <th>薪水</th>\n",
       "      <th>时间</th>\n",
       "      <th>职称</th>\n",
       "      <th>公司地点</th>\n",
       "      <th>公司名称</th>\n",
       "      <th>链结</th>\n",
       "      <th>公司URL</th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <td>0</td>\n",
       "      <td>大专及以上</td>\n",
       "      <td>2年以上</td>\n",
       "      <td>6-8k·12薪</td>\n",
       "      <td>2020年04月06日</td>\n",
       "      <td>大客户主管、专员</td>\n",
       "      <td>武汉</td>\n",
       "      <td>武汉复星汉正街房地产开发有限公司</td>\n",
       "      <td>https://www.liepin.com/job/1927240251.shtml</td>\n",
       "      <td>https://www.liepin.com/company/8549940/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>1</td>\n",
       "      <td>统招本科</td>\n",
       "      <td>3年以上</td>\n",
       "      <td>面议</td>\n",
       "      <td>2020年04月06日</td>\n",
       "      <td>人力资源主管</td>\n",
       "      <td>青岛</td>\n",
       "      <td>青岛碧桂园产城发展有限公司</td>\n",
       "      <td>https://www.liepin.com/job/1927239763.shtml</td>\n",
       "      <td>https://www.liepin.com/company/9654764/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>2</td>\n",
       "      <td>本科及以上</td>\n",
       "      <td>5年以上</td>\n",
       "      <td>20-30k·12薪</td>\n",
       "      <td>2020年04月06日</td>\n",
       "      <td>云计算销售经理</td>\n",
       "      <td>上海-虹桥</td>\n",
       "      <td>富通东方</td>\n",
       "      <td>https://www.liepin.com/job/1927239637.shtml</td>\n",
       "      <td>https://www.liepin.com/company/895509/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>3</td>\n",
       "      <td>统招本科</td>\n",
       "      <td>3年以上</td>\n",
       "      <td>面议</td>\n",
       "      <td>2020年04月06日</td>\n",
       "      <td>品牌媒介主管</td>\n",
       "      <td>青岛-中韩</td>\n",
       "      <td>青岛碧桂园产城发展有限公司</td>\n",
       "      <td>https://www.liepin.com/job/1927239617.shtml</td>\n",
       "      <td>https://www.liepin.com/company/9654764/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>4</td>\n",
       "      <td>统招本科</td>\n",
       "      <td>5年以上</td>\n",
       "      <td>15-20k·12薪</td>\n",
       "      <td>2020年04月06日</td>\n",
       "      <td>口腔医生/儿牙医生</td>\n",
       "      <td>济南</td>\n",
       "      <td>济南可恩口腔医院有限公司</td>\n",
       "      <td>https://www.liepin.com/job/1927239607.shtml</td>\n",
       "      <td>https://www.liepin.com/company/8267148/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>5</td>\n",
       "      <td>大专及以上</td>\n",
       "      <td>1年以上</td>\n",
       "      <td>10-20k·12薪</td>\n",
       "      <td>2020年04月06日</td>\n",
       "      <td>医疗美容整形材料销售</td>\n",
       "      <td>广州</td>\n",
       "      <td>意之美(广州)生物科技有限公司</td>\n",
       "      <td>https://www.liepin.com/job/1927224799.shtml</td>\n",
       "      <td>https://www.liepin.com/company/12001657/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>6</td>\n",
       "      <td>大专及以上</td>\n",
       "      <td>1年以上</td>\n",
       "      <td>10-20k·12薪</td>\n",
       "      <td>2020年04月06日</td>\n",
       "      <td>医疗美容整形材料销售</td>\n",
       "      <td>深圳</td>\n",
       "      <td>意之美(广州)生物科技有限公司</td>\n",
       "      <td>https://www.liepin.com/job/1927224787.shtml</td>\n",
       "      <td>https://www.liepin.com/company/12001657/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>7</td>\n",
       "      <td>大专及以上</td>\n",
       "      <td>3年以上</td>\n",
       "      <td>5-10k·12薪</td>\n",
       "      <td>2020年04月06日</td>\n",
       "      <td>渠道推广（陕南陕北）</td>\n",
       "      <td>西安-张家堡</td>\n",
       "      <td>陕西长宇酒业有限公司</td>\n",
       "      <td>https://www.liepin.com/job/1927215425.shtml</td>\n",
       "      <td>https://www.liepin.com/company/9624258/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>8</td>\n",
       "      <td>大专及以上</td>\n",
       "      <td>4年以上</td>\n",
       "      <td>15-35k·13薪</td>\n",
       "      <td>2020年04月06日</td>\n",
       "      <td>产品经理</td>\n",
       "      <td>深圳-南油</td>\n",
       "      <td>上海隆长信息技术有限公司</td>\n",
       "      <td>https://www.liepin.com/job/1927208933.shtml</td>\n",
       "      <td>https://www.liepin.com/company/9707913/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>9</td>\n",
       "      <td>大专及以上</td>\n",
       "      <td>5年以上</td>\n",
       "      <td>面议</td>\n",
       "      <td>2020年04月06日</td>\n",
       "      <td>高级培训经理（培训负责人）</td>\n",
       "      <td>晋中-榆次区</td>\n",
       "      <td>山西田森文创置业有限公司</td>\n",
       "      <td>https://www.liepin.com/job/1927178745.shtml</td>\n",
       "      <td>https://www.liepin.com/company/10181635/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>10</td>\n",
       "      <td>本科及以上</td>\n",
       "      <td>5年以上</td>\n",
       "      <td>33-60k·12薪</td>\n",
       "      <td>2020年04月06日</td>\n",
       "      <td>投资合作总经理/总监（投合项目）</td>\n",
       "      <td>重庆-渝北区</td>\n",
       "      <td>重庆海成实业(集团)有限公司</td>\n",
       "      <td>https://www.liepin.com/job/1927175665.shtml</td>\n",
       "      <td>https://www.liepin.com/company/7902708/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>11</td>\n",
       "      <td>统招本科</td>\n",
       "      <td>5年以上</td>\n",
       "      <td>10-15k·13薪</td>\n",
       "      <td>2020年04月06日</td>\n",
       "      <td>财务部经理</td>\n",
       "      <td>郑州</td>\n",
       "      <td>郑州一建集团有限公司</td>\n",
       "      <td>https://www.liepin.com/job/1927175247.shtml</td>\n",
       "      <td>https://www.liepin.com/company/10085847/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>12</td>\n",
       "      <td>本科及以上</td>\n",
       "      <td>5年以上</td>\n",
       "      <td>20-30k·12薪</td>\n",
       "      <td>2020年04月06日</td>\n",
       "      <td>安装造价专业经理</td>\n",
       "      <td>深圳-华强北</td>\n",
       "      <td>合景泰富集团莞惠城市公司</td>\n",
       "      <td>https://www.liepin.com/job/1927174785.shtml</td>\n",
       "      <td>https://www.liepin.com/company/9569421/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>13</td>\n",
       "      <td>大专及以上</td>\n",
       "      <td>3年以上</td>\n",
       "      <td>15-25k·12薪</td>\n",
       "      <td>2020年04月06日</td>\n",
       "      <td>电商运营主管/经理</td>\n",
       "      <td>深圳</td>\n",
       "      <td>欧莎</td>\n",
       "      <td>https://www.liepin.com/job/1927170947.shtml</td>\n",
       "      <td>https://www.liepin.com/company/5653173/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>14</td>\n",
       "      <td>统招本科</td>\n",
       "      <td>3年以上</td>\n",
       "      <td>7-12k·12薪</td>\n",
       "      <td>2020年04月06日</td>\n",
       "      <td>人才发展岗</td>\n",
       "      <td>广州-越秀区</td>\n",
       "      <td>广东链家房地产经纪有限公司广州分公司</td>\n",
       "      <td>https://www.liepin.com/job/1927168205.shtml</td>\n",
       "      <td>https://www.liepin.com/company/9869391/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>15</td>\n",
       "      <td>统招本科</td>\n",
       "      <td>5年以上</td>\n",
       "      <td>面议</td>\n",
       "      <td>2020年04月06日</td>\n",
       "      <td>工程机电专业副经理/主管</td>\n",
       "      <td>苏州-虎丘区</td>\n",
       "      <td>星河控股集团有限公司</td>\n",
       "      <td>https://www.liepin.com/job/1927167457.shtml</td>\n",
       "      <td>https://www.liepin.com/company/10021555/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>16</td>\n",
       "      <td>本科及以上</td>\n",
       "      <td>2年以上</td>\n",
       "      <td>7-16k·14薪</td>\n",
       "      <td>2020年04月06日</td>\n",
       "      <td>高级亚马逊运营专员（储备组长）</td>\n",
       "      <td>广州-海珠区</td>\n",
       "      <td>广州瀚翼扬电子商务有限公司</td>\n",
       "      <td>https://www.liepin.com/job/1927163671.shtml</td>\n",
       "      <td>https://www.liepin.com/company/8881741/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>17</td>\n",
       "      <td>大专及以上</td>\n",
       "      <td>2年以上</td>\n",
       "      <td>8-12k·12薪</td>\n",
       "      <td>2020年04月06日</td>\n",
       "      <td>抖音直播运营</td>\n",
       "      <td>杭州</td>\n",
       "      <td>杭州姬曼服装有限公司</td>\n",
       "      <td>https://www.liepin.com/job/1927156957.shtml</td>\n",
       "      <td>https://www.liepin.com/company/9143267/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>18</td>\n",
       "      <td>大专及以上</td>\n",
       "      <td>3年以上</td>\n",
       "      <td>10-30k·12薪</td>\n",
       "      <td>2020年04月06日</td>\n",
       "      <td>客户经理</td>\n",
       "      <td>郑州-金水区</td>\n",
       "      <td>黄河科技集团信息产业发展有限公司</td>\n",
       "      <td>https://www.liepin.com/job/1927150413.shtml</td>\n",
       "      <td>https://www.liepin.com/company/12181873/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>19</td>\n",
       "      <td>大专及以上</td>\n",
       "      <td>3年以上</td>\n",
       "      <td>12-18k·13薪</td>\n",
       "      <td>2020年04月06日</td>\n",
       "      <td>安装造价师</td>\n",
       "      <td>南昌-红谷滩新区</td>\n",
       "      <td>九江市鑫达房地产开发有限公司</td>\n",
       "      <td>https://www.liepin.com/job/1927145775.shtml</td>\n",
       "      <td>https://www.liepin.com/company/12160995/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>20</td>\n",
       "      <td>大专及以上</td>\n",
       "      <td>2年以上</td>\n",
       "      <td>4-6k·12薪</td>\n",
       "      <td>2020年04月06日</td>\n",
       "      <td>会计</td>\n",
       "      <td>广州</td>\n",
       "      <td>贵州省仁怀市华盛名酒业销售有限公司广东分公司</td>\n",
       "      <td>https://www.liepin.com/job/1927140501.shtml</td>\n",
       "      <td>https://www.liepin.com/company/10226159/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>21</td>\n",
       "      <td>硕士及以上</td>\n",
       "      <td>5年以上</td>\n",
       "      <td>面议</td>\n",
       "      <td>2020年04月06日</td>\n",
       "      <td>广东省农业融资担保有限责任公司风控部法务岗</td>\n",
       "      <td>广州-越秀区</td>\n",
       "      <td>粤财控股</td>\n",
       "      <td>https://www.liepin.com/job/1927127833.shtml</td>\n",
       "      <td>https://www.liepin.com/company/8017350/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>22</td>\n",
       "      <td>本科及以上</td>\n",
       "      <td>3年以上</td>\n",
       "      <td>面议</td>\n",
       "      <td>2020年04月06日</td>\n",
       "      <td>广东省农业融资担保有限责任公司江门分公司业务岗</td>\n",
       "      <td>江门</td>\n",
       "      <td>粤财控股</td>\n",
       "      <td>https://www.liepin.com/job/1927127771.shtml</td>\n",
       "      <td>https://www.liepin.com/company/8017350/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>23</td>\n",
       "      <td>本科及以上</td>\n",
       "      <td>3年以上</td>\n",
       "      <td>面议</td>\n",
       "      <td>2020年04月06日</td>\n",
       "      <td>广东省农业融资担保有限责任公司梅州分公司业务岗</td>\n",
       "      <td>梅州</td>\n",
       "      <td>粤财控股</td>\n",
       "      <td>https://www.liepin.com/job/1927127759.shtml</td>\n",
       "      <td>https://www.liepin.com/company/8017350/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>24</td>\n",
       "      <td>学历不限</td>\n",
       "      <td>经验不限</td>\n",
       "      <td>20-50k·12薪</td>\n",
       "      <td>2020年04月06日</td>\n",
       "      <td>体育渠道搜寻专家</td>\n",
       "      <td></td>\n",
       "      <td>北京伟德体育文化有限公司</td>\n",
       "      <td>https://www.liepin.com/job/1927066603.shtml</td>\n",
       "      <td>https://www.liepin.com/company/10243789/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>25</td>\n",
       "      <td>大专及以上</td>\n",
       "      <td>5年以上</td>\n",
       "      <td>30-50k·12薪</td>\n",
       "      <td>2020年04月06日</td>\n",
       "      <td>房地产营销总监</td>\n",
       "      <td>温州</td>\n",
       "      <td>温州亿联科技有限公司</td>\n",
       "      <td>https://www.liepin.com/job/1927059783.shtml</td>\n",
       "      <td>https://www.liepin.com/company/10108117/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>26</td>\n",
       "      <td>学历不限</td>\n",
       "      <td>经验不限</td>\n",
       "      <td>7-15k·12薪</td>\n",
       "      <td>2020年04月06日</td>\n",
       "      <td>直播运营（快手）</td>\n",
       "      <td>广州</td>\n",
       "      <td>衣酷集团</td>\n",
       "      <td>https://www.liepin.com/job/1926992523.shtml</td>\n",
       "      <td>https://www.liepin.com/company/7905352/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>27</td>\n",
       "      <td>大专及以上</td>\n",
       "      <td>3年以上</td>\n",
       "      <td>15-30k·12薪</td>\n",
       "      <td>2020年04月06日</td>\n",
       "      <td>伺服销售</td>\n",
       "      <td>深圳</td>\n",
       "      <td>广东奥普特科技股份有限公司</td>\n",
       "      <td>https://www.liepin.com/job/1926985729.shtml</td>\n",
       "      <td>https://www.liepin.com/company/9397140/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>28</td>\n",
       "      <td>大专及以上</td>\n",
       "      <td>3年以上</td>\n",
       "      <td>15-30k·12薪</td>\n",
       "      <td>2020年04月06日</td>\n",
       "      <td>伺服销售</td>\n",
       "      <td>东莞-松岗</td>\n",
       "      <td>广东奥普特科技股份有限公司</td>\n",
       "      <td>https://www.liepin.com/job/1926985587.shtml</td>\n",
       "      <td>https://www.liepin.com/company/9397140/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>29</td>\n",
       "      <td>本科及以上</td>\n",
       "      <td>5年以上</td>\n",
       "      <td>15-22k·12薪</td>\n",
       "      <td>2020年04月06日</td>\n",
       "      <td>高级财务经理(J11681)</td>\n",
       "      <td>江门</td>\n",
       "      <td>领益智造</td>\n",
       "      <td>https://www.liepin.com/job/1926942837.shtml</td>\n",
       "      <td>https://www.liepin.com/company/9928281/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>30</td>\n",
       "      <td>大专及以上</td>\n",
       "      <td>1年以上</td>\n",
       "      <td>3-5k·12薪</td>\n",
       "      <td>2020年04月06日</td>\n",
       "      <td>出纳</td>\n",
       "      <td>广州</td>\n",
       "      <td>贵州省仁怀市华盛名酒业销售有限公司广东分公司</td>\n",
       "      <td>https://www.liepin.com/job/1926933821.shtml</td>\n",
       "      <td>https://www.liepin.com/company/10226159/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>31</td>\n",
       "      <td>本科及以上</td>\n",
       "      <td>2年以上</td>\n",
       "      <td>8-10k·12薪</td>\n",
       "      <td>2020年04月06日</td>\n",
       "      <td>施工员兼技术员</td>\n",
       "      <td>上海-奉贤区</td>\n",
       "      <td>中浩建设股份有限公司上海分公司</td>\n",
       "      <td>https://www.liepin.com/job/1926740481.shtml</td>\n",
       "      <td>https://www.liepin.com/company/7867231/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>32</td>\n",
       "      <td>本科及以上</td>\n",
       "      <td>3年以上</td>\n",
       "      <td>面议</td>\n",
       "      <td>2020年04月06日</td>\n",
       "      <td>销售财务主管</td>\n",
       "      <td>上海-闵行区</td>\n",
       "      <td>黑龙江和美泰富食品有限公司</td>\n",
       "      <td>https://www.liepin.com/job/1926730179.shtml</td>\n",
       "      <td>https://www.liepin.com/company/12164285/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>33</td>\n",
       "      <td>本科及以上</td>\n",
       "      <td>5年以上</td>\n",
       "      <td>20-40k·12薪</td>\n",
       "      <td>2020年04月06日</td>\n",
       "      <td>主设计师/设计总监</td>\n",
       "      <td>杭州-余杭区</td>\n",
       "      <td>杭州伯喜服饰有限公司</td>\n",
       "      <td>https://www.liepin.com/job/1926711403.shtml</td>\n",
       "      <td>https://www.liepin.com/company/9739935/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>34</td>\n",
       "      <td>本科及以上</td>\n",
       "      <td>8年以上</td>\n",
       "      <td>20-40k·12薪</td>\n",
       "      <td>2020年04月06日</td>\n",
       "      <td>渠道拓展总监</td>\n",
       "      <td>杭州-余杭区</td>\n",
       "      <td>杭州伯喜服饰有限公司</td>\n",
       "      <td>https://www.liepin.com/job/1926711285.shtml</td>\n",
       "      <td>https://www.liepin.com/company/9739935/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>35</td>\n",
       "      <td>本科及以上</td>\n",
       "      <td>3年以上</td>\n",
       "      <td>面议</td>\n",
       "      <td>2020年04月06日</td>\n",
       "      <td>电商运营经理</td>\n",
       "      <td>上海-闵行区</td>\n",
       "      <td>黑龙江和美泰富食品有限公司</td>\n",
       "      <td>https://www.liepin.com/job/1926652769.shtml</td>\n",
       "      <td>https://www.liepin.com/company/12164285/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>36</td>\n",
       "      <td>统招本科</td>\n",
       "      <td>3年以上</td>\n",
       "      <td>8-10k·15薪</td>\n",
       "      <td>2020年04月06日</td>\n",
       "      <td>客户服务岗</td>\n",
       "      <td>广州-珠江新城</td>\n",
       "      <td>天安人寿保险股份有限公司广东分公司</td>\n",
       "      <td>https://www.liepin.com/job/1926501375.shtml</td>\n",
       "      <td>https://www.liepin.com/company/9258426/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>37</td>\n",
       "      <td>本科及以上</td>\n",
       "      <td>1年以上</td>\n",
       "      <td>8-15k·12薪</td>\n",
       "      <td>2020年04月06日</td>\n",
       "      <td>运营助理/专员</td>\n",
       "      <td></td>\n",
       "      <td>金链汇信科技发展(北京)有限公司</td>\n",
       "      <td>https://www.liepin.com/job/1926170393.shtml</td>\n",
       "      <td>https://www.liepin.com/company/9705621/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>38</td>\n",
       "      <td>本科及以上</td>\n",
       "      <td>7年以上</td>\n",
       "      <td>面议</td>\n",
       "      <td>2020年04月06日</td>\n",
       "      <td>(Sr.) Sales Account Manager</td>\n",
       "      <td>上海-浦东新区</td>\n",
       "      <td>MATLAB</td>\n",
       "      <td>https://www.liepin.com/job/1925634897.shtml</td>\n",
       "      <td>https://www.liepin.com/company/8185522/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>39</td>\n",
       "      <td>大专及以上</td>\n",
       "      <td>1年以上</td>\n",
       "      <td>6-10k·12薪</td>\n",
       "      <td>2020年04月06日</td>\n",
       "      <td>技术员</td>\n",
       "      <td>杭州-上塘</td>\n",
       "      <td>杭州国泰门窗工程有限公司</td>\n",
       "      <td>https://www.liepin.com/job/1925586149.shtml</td>\n",
       "      <td>https://www.liepin.com/company/8073289/</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "</div>"
      ],
      "text/plain": [
       "      edu    经验          薪水           时间                           职称  \\\n",
       "0   大专及以上  2年以上    6-8k·12薪  2020年04月06日                     大客户主管、专员   \n",
       "1    统招本科  3年以上          面议  2020年04月06日                       人力资源主管   \n",
       "2   本科及以上  5年以上  20-30k·12薪  2020年04月06日                      云计算销售经理   \n",
       "3    统招本科  3年以上          面议  2020年04月06日                       品牌媒介主管   \n",
       "4    统招本科  5年以上  15-20k·12薪  2020年04月06日                    口腔医生/儿牙医生   \n",
       "5   大专及以上  1年以上  10-20k·12薪  2020年04月06日                   医疗美容整形材料销售   \n",
       "6   大专及以上  1年以上  10-20k·12薪  2020年04月06日                   医疗美容整形材料销售   \n",
       "7   大专及以上  3年以上   5-10k·12薪  2020年04月06日                   渠道推广（陕南陕北）   \n",
       "8   大专及以上  4年以上  15-35k·13薪  2020年04月06日                         产品经理   \n",
       "9   大专及以上  5年以上          面议  2020年04月06日                高级培训经理（培训负责人）   \n",
       "10  本科及以上  5年以上  33-60k·12薪  2020年04月06日             投资合作总经理/总监（投合项目）   \n",
       "11   统招本科  5年以上  10-15k·13薪  2020年04月06日                        财务部经理   \n",
       "12  本科及以上  5年以上  20-30k·12薪  2020年04月06日                     安装造价专业经理   \n",
       "13  大专及以上  3年以上  15-25k·12薪  2020年04月06日                    电商运营主管/经理   \n",
       "14   统招本科  3年以上   7-12k·12薪  2020年04月06日                        人才发展岗   \n",
       "15   统招本科  5年以上          面议  2020年04月06日                 工程机电专业副经理/主管   \n",
       "16  本科及以上  2年以上   7-16k·14薪  2020年04月06日              高级亚马逊运营专员（储备组长）   \n",
       "17  大专及以上  2年以上   8-12k·12薪  2020年04月06日                       抖音直播运营   \n",
       "18  大专及以上  3年以上  10-30k·12薪  2020年04月06日                         客户经理   \n",
       "19  大专及以上  3年以上  12-18k·13薪  2020年04月06日                        安装造价师   \n",
       "20  大专及以上  2年以上    4-6k·12薪  2020年04月06日                           会计   \n",
       "21  硕士及以上  5年以上          面议  2020年04月06日        广东省农业融资担保有限责任公司风控部法务岗   \n",
       "22  本科及以上  3年以上          面议  2020年04月06日      广东省农业融资担保有限责任公司江门分公司业务岗   \n",
       "23  本科及以上  3年以上          面议  2020年04月06日      广东省农业融资担保有限责任公司梅州分公司业务岗   \n",
       "24   学历不限  经验不限  20-50k·12薪  2020年04月06日                     体育渠道搜寻专家   \n",
       "25  大专及以上  5年以上  30-50k·12薪  2020年04月06日                      房地产营销总监   \n",
       "26   学历不限  经验不限   7-15k·12薪  2020年04月06日                     直播运营（快手）   \n",
       "27  大专及以上  3年以上  15-30k·12薪  2020年04月06日                         伺服销售   \n",
       "28  大专及以上  3年以上  15-30k·12薪  2020年04月06日                         伺服销售   \n",
       "29  本科及以上  5年以上  15-22k·12薪  2020年04月06日               高级财务经理(J11681)   \n",
       "30  大专及以上  1年以上    3-5k·12薪  2020年04月06日                           出纳   \n",
       "31  本科及以上  2年以上   8-10k·12薪  2020年04月06日                      施工员兼技术员   \n",
       "32  本科及以上  3年以上          面议  2020年04月06日                       销售财务主管   \n",
       "33  本科及以上  5年以上  20-40k·12薪  2020年04月06日                    主设计师/设计总监   \n",
       "34  本科及以上  8年以上  20-40k·12薪  2020年04月06日                       渠道拓展总监   \n",
       "35  本科及以上  3年以上          面议  2020年04月06日                       电商运营经理   \n",
       "36   统招本科  3年以上   8-10k·15薪  2020年04月06日                        客户服务岗   \n",
       "37  本科及以上  1年以上   8-15k·12薪  2020年04月06日                      运营助理/专员   \n",
       "38  本科及以上  7年以上          面议  2020年04月06日  (Sr.) Sales Account Manager   \n",
       "39  大专及以上  1年以上   6-10k·12薪  2020年04月06日                          技术员   \n",
       "\n",
       "        公司地点                    公司名称  \\\n",
       "0         武汉        武汉复星汉正街房地产开发有限公司   \n",
       "1         青岛           青岛碧桂园产城发展有限公司   \n",
       "2      上海-虹桥                    富通东方   \n",
       "3      青岛-中韩           青岛碧桂园产城发展有限公司   \n",
       "4         济南            济南可恩口腔医院有限公司   \n",
       "5         广州         意之美(广州)生物科技有限公司   \n",
       "6         深圳         意之美(广州)生物科技有限公司   \n",
       "7     西安-张家堡              陕西长宇酒业有限公司   \n",
       "8      深圳-南油            上海隆长信息技术有限公司   \n",
       "9     晋中-榆次区            山西田森文创置业有限公司   \n",
       "10    重庆-渝北区          重庆海成实业(集团)有限公司   \n",
       "11        郑州              郑州一建集团有限公司   \n",
       "12    深圳-华强北            合景泰富集团莞惠城市公司   \n",
       "13        深圳                      欧莎   \n",
       "14    广州-越秀区      广东链家房地产经纪有限公司广州分公司   \n",
       "15    苏州-虎丘区              星河控股集团有限公司   \n",
       "16    广州-海珠区           广州瀚翼扬电子商务有限公司   \n",
       "17        杭州              杭州姬曼服装有限公司   \n",
       "18    郑州-金水区        黄河科技集团信息产业发展有限公司   \n",
       "19  南昌-红谷滩新区          九江市鑫达房地产开发有限公司   \n",
       "20        广州  贵州省仁怀市华盛名酒业销售有限公司广东分公司   \n",
       "21    广州-越秀区                    粤财控股   \n",
       "22        江门                    粤财控股   \n",
       "23        梅州                    粤财控股   \n",
       "24                      北京伟德体育文化有限公司   \n",
       "25        温州              温州亿联科技有限公司   \n",
       "26        广州                    衣酷集团   \n",
       "27        深圳           广东奥普特科技股份有限公司   \n",
       "28     东莞-松岗           广东奥普特科技股份有限公司   \n",
       "29        江门                    领益智造   \n",
       "30        广州  贵州省仁怀市华盛名酒业销售有限公司广东分公司   \n",
       "31    上海-奉贤区         中浩建设股份有限公司上海分公司   \n",
       "32    上海-闵行区           黑龙江和美泰富食品有限公司   \n",
       "33    杭州-余杭区              杭州伯喜服饰有限公司   \n",
       "34    杭州-余杭区              杭州伯喜服饰有限公司   \n",
       "35    上海-闵行区           黑龙江和美泰富食品有限公司   \n",
       "36   广州-珠江新城       天安人寿保险股份有限公司广东分公司   \n",
       "37                  金链汇信科技发展(北京)有限公司   \n",
       "38   上海-浦东新区                  MATLAB   \n",
       "39     杭州-上塘            杭州国泰门窗工程有限公司   \n",
       "\n",
       "                                             链结  \\\n",
       "0   https://www.liepin.com/job/1927240251.shtml   \n",
       "1   https://www.liepin.com/job/1927239763.shtml   \n",
       "2   https://www.liepin.com/job/1927239637.shtml   \n",
       "3   https://www.liepin.com/job/1927239617.shtml   \n",
       "4   https://www.liepin.com/job/1927239607.shtml   \n",
       "5   https://www.liepin.com/job/1927224799.shtml   \n",
       "6   https://www.liepin.com/job/1927224787.shtml   \n",
       "7   https://www.liepin.com/job/1927215425.shtml   \n",
       "8   https://www.liepin.com/job/1927208933.shtml   \n",
       "9   https://www.liepin.com/job/1927178745.shtml   \n",
       "10  https://www.liepin.com/job/1927175665.shtml   \n",
       "11  https://www.liepin.com/job/1927175247.shtml   \n",
       "12  https://www.liepin.com/job/1927174785.shtml   \n",
       "13  https://www.liepin.com/job/1927170947.shtml   \n",
       "14  https://www.liepin.com/job/1927168205.shtml   \n",
       "15  https://www.liepin.com/job/1927167457.shtml   \n",
       "16  https://www.liepin.com/job/1927163671.shtml   \n",
       "17  https://www.liepin.com/job/1927156957.shtml   \n",
       "18  https://www.liepin.com/job/1927150413.shtml   \n",
       "19  https://www.liepin.com/job/1927145775.shtml   \n",
       "20  https://www.liepin.com/job/1927140501.shtml   \n",
       "21  https://www.liepin.com/job/1927127833.shtml   \n",
       "22  https://www.liepin.com/job/1927127771.shtml   \n",
       "23  https://www.liepin.com/job/1927127759.shtml   \n",
       "24  https://www.liepin.com/job/1927066603.shtml   \n",
       "25  https://www.liepin.com/job/1927059783.shtml   \n",
       "26  https://www.liepin.com/job/1926992523.shtml   \n",
       "27  https://www.liepin.com/job/1926985729.shtml   \n",
       "28  https://www.liepin.com/job/1926985587.shtml   \n",
       "29  https://www.liepin.com/job/1926942837.shtml   \n",
       "30  https://www.liepin.com/job/1926933821.shtml   \n",
       "31  https://www.liepin.com/job/1926740481.shtml   \n",
       "32  https://www.liepin.com/job/1926730179.shtml   \n",
       "33  https://www.liepin.com/job/1926711403.shtml   \n",
       "34  https://www.liepin.com/job/1926711285.shtml   \n",
       "35  https://www.liepin.com/job/1926652769.shtml   \n",
       "36  https://www.liepin.com/job/1926501375.shtml   \n",
       "37  https://www.liepin.com/job/1926170393.shtml   \n",
       "38  https://www.liepin.com/job/1925634897.shtml   \n",
       "39  https://www.liepin.com/job/1925586149.shtml   \n",
       "\n",
       "                                       公司URL  \n",
       "0    https://www.liepin.com/company/8549940/  \n",
       "1    https://www.liepin.com/company/9654764/  \n",
       "2     https://www.liepin.com/company/895509/  \n",
       "3    https://www.liepin.com/company/9654764/  \n",
       "4    https://www.liepin.com/company/8267148/  \n",
       "5   https://www.liepin.com/company/12001657/  \n",
       "6   https://www.liepin.com/company/12001657/  \n",
       "7    https://www.liepin.com/company/9624258/  \n",
       "8    https://www.liepin.com/company/9707913/  \n",
       "9   https://www.liepin.com/company/10181635/  \n",
       "10   https://www.liepin.com/company/7902708/  \n",
       "11  https://www.liepin.com/company/10085847/  \n",
       "12   https://www.liepin.com/company/9569421/  \n",
       "13   https://www.liepin.com/company/5653173/  \n",
       "14   https://www.liepin.com/company/9869391/  \n",
       "15  https://www.liepin.com/company/10021555/  \n",
       "16   https://www.liepin.com/company/8881741/  \n",
       "17   https://www.liepin.com/company/9143267/  \n",
       "18  https://www.liepin.com/company/12181873/  \n",
       "19  https://www.liepin.com/company/12160995/  \n",
       "20  https://www.liepin.com/company/10226159/  \n",
       "21   https://www.liepin.com/company/8017350/  \n",
       "22   https://www.liepin.com/company/8017350/  \n",
       "23   https://www.liepin.com/company/8017350/  \n",
       "24  https://www.liepin.com/company/10243789/  \n",
       "25  https://www.liepin.com/company/10108117/  \n",
       "26   https://www.liepin.com/company/7905352/  \n",
       "27   https://www.liepin.com/company/9397140/  \n",
       "28   https://www.liepin.com/company/9397140/  \n",
       "29   https://www.liepin.com/company/9928281/  \n",
       "30  https://www.liepin.com/company/10226159/  \n",
       "31   https://www.liepin.com/company/7867231/  \n",
       "32  https://www.liepin.com/company/12164285/  \n",
       "33   https://www.liepin.com/company/9739935/  \n",
       "34   https://www.liepin.com/company/9739935/  \n",
       "35  https://www.liepin.com/company/12164285/  \n",
       "36   https://www.liepin.com/company/9258426/  \n",
       "37   https://www.liepin.com/company/9705621/  \n",
       "38   https://www.liepin.com/company/8185522/  \n",
       "39   https://www.liepin.com/company/8073289/  "
      ]
     },
     "execution_count": 6,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "# A-1   单一页面\n",
    "url = \"https://www.liepin.com/zhaopin/?keyword=PRD\"\n",
    "session = HTMLSession()\n",
    "r = session.get( url )\n",
    "\n",
    "# 先取特定元素, 精准打击其子后辈\n",
    "主要元素 = r.html.xpath( \\\n",
    "    '//ul[@class=\"sojob-list\"]/li')\n",
    "\n",
    "# 预期是一个元素的列表？\n",
    "#print (主要元素[0].xpath('//div[contains(@class,\"sojob-item-main\")]'))\n",
    "#print (主要元素[0].xpath('//div[contains(@class,\"job-info\")]/h3/a'))\n",
    "#print (主要元素[3].xpath('//div[contains(@class,\"job-info\")]/p/a'))\n",
    "#print (主要元素[3].xpath('//div[contains(@class,\"job-info\")]/p/span[@class=\"text-warning\"]'))\n",
    "#print (主要元素[3].xpath('//div[contains(@class,\"job-info\")]/p/span[@class=\"edu\"]/following-sibling::span'))\n",
    "#print (主要元素[3].xpath('//div[contains(@class,\"job-info\")]/p/time/@title'))\n",
    "#print (主要元素[0].xpath('//div[contains(@class,\"sojob-item-main\")]//p[@class=\"company-name\"]/a'))\n",
    "\n",
    "# 作为xpath字典，键为我要抓的牛肉名称，值为xpath\n",
    "dict_xpaths={ \n",
    "    'text': {\n",
    "        'edu':      '//div[contains(@class,\"job-info\")]/p/span[@class=\"edu\"]',\n",
    "        '经验':      '//div[contains(@class,\"job-info\")]/p/span[@class=\"edu\"]/following-sibling::span',\n",
    "        '薪水':    '//div[contains(@class,\"job-info\")]/p/span[@class=\"text-warning\"]', \n",
    "        '时间':    '//div[contains(@class,\"job-info\")]/p/time/@title', \n",
    "        '职称':    '//div[contains(@class,\"job-info\")]/h3/a', \n",
    "        '公司地点': '//div[contains(@class,\"job-info\")]/p/a',\n",
    "        '公司名称': '//div[contains(@class,\"sojob-item-main\")]//p[@class=\"company-name\"]/a', \n",
    "    },\n",
    "    'text_content': {\n",
    "    },\n",
    "    'href': {\n",
    "        '链结':    '//div[contains(@class,\"job-info\")]/h3/a', \n",
    "        '公司URL': '//div[contains(@class,\"sojob-item-main\")]//p[@class=\"company-name\"]/a', \n",
    "    }\n",
    "}\n",
    "\n",
    "def get_e_text_content(_xpath_):\n",
    "    # 高级列表推导\n",
    "    暂存结果 = [e.xpath(_xpath_)[0].lxml.text_content() for e in 主要元素]\n",
    "    return(暂存结果)\n",
    "\n",
    "def get_e_text(_xpath_):\n",
    "    # 高级列表推导\n",
    "    暂存结果 = [\"\".join([x.strip() if type(x) is str else x.text.strip() for x in e.xpath(_xpath_)]) for e in 主要元素]\n",
    "    return(暂存结果)\n",
    "\n",
    "def get_e_href(_xpath_):\n",
    "    # 高级列表推导\n",
    "    暂存结果 = [list(e.xpath(_xpath_, first=True).absolute_links)[0] \\\n",
    "               if len(e.xpath(_xpath_, first=True).absolute_links) >= 1  \\\n",
    "               else \"\" for e in 主要元素]\n",
    "    return(暂存结果)\n",
    "\n",
    "# 只对主要元素下进行.xpath取值\n",
    "数据字典 = dict()\n",
    "\n",
    "数据字典 = {k:get_e_text_content(v) for k,v in dict_xpaths['text_content'].items()}\n",
    "数据字典.update({k:get_e_text(v) for k,v in dict_xpaths['text'].items()})\n",
    "数据字典.update({k:get_e_href(v) for k,v in dict_xpaths['href'].items()})\n",
    "\n",
    "# print([len(v) for k,v in 数据字典.items()])   # 检查数据量，长度...\n",
    "\n",
    "数据 = pd.DataFrame(数据字典)\n",
    "数据.to_excel(\"20春_Web数据挖掘_week03_liepin.xlsx\", sheet_name=\"搜查结果\")\n",
    "数据 "
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 5,
   "metadata": {},
   "outputs": [
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
      "[<Element 'div' class=('search-conditions',) data-selector='search-conditions'>]\n",
      "<Element 'div' class=('search-conditions',) data-selector='search-conditions'>\n",
      "[<Element 'dt' class=('search-title',)>, <Element 'dt' class=('search-title',)>, <Element 'dt' class=('search-title',)>, <Element 'dt' class=('search-title',)>, <Element 'dt' class=('search-title',)>]\n",
      "公司：\n",
      "行业：\n",
      "城市：\n",
      "薪资：\n",
      "更多：\n",
      "<Element 'dd' class=('comp-list',)>\n",
      "<Element 'dd' class=('short-dd', 'select-industry') data-param='industries'>\n",
      "<Element 'dd' data-param='city'>\n",
      "<Element 'dd' data-param='salary'>\n",
      "<Element 'dd' class=('dropdown', 'dropdown-time')>\n",
      "<Element 'dd' class=('dropdown', 'dropdown-jobkind')>\n",
      "<Element 'dd' class=('dropdown', 'dropdown-compscale')>\n",
      "<Element 'dd' class=('dropdown', 'dropdown-compkind')>\n"
     ]
    },
    {
     "data": {
      "text/plain": [
       "{'中国500强': '/zhaopin/?init=-1&headckid=8866e146c48f66fa&flushckid=1&fromSearchBtn=2&keyword=PRD&compTag=155&ckid=8866e146c48f66fa&siTag=1B2M2Y8AsgTpgAmY7PhCfg%7EfA9rXquZc5IkJpXC-Ycixw&d_sfrom=search_unknown&d_ckId=a480a26be5e22fead4318a487a8710a0&d_curPage=0&d_pageSize=40&d_headId=a480a26be5e22fead4318a487a8710a0',\n",
       " '2018互联网300强': '/zhaopin/?init=-1&headckid=8866e146c48f66fa&flushckid=1&fromSearchBtn=2&keyword=PRD&compTag=182&ckid=8866e146c48f66fa&siTag=1B2M2Y8AsgTpgAmY7PhCfg%7EfA9rXquZc5IkJpXC-Ycixw&d_sfrom=search_unknown&d_ckId=a480a26be5e22fead4318a487a8710a0&d_curPage=0&d_pageSize=40&d_headId=a480a26be5e22fead4318a487a8710a0',\n",
       " '制造业500强': '/zhaopin/?init=-1&headckid=8866e146c48f66fa&flushckid=1&fromSearchBtn=2&keyword=PRD&compTag=186&ckid=8866e146c48f66fa&siTag=1B2M2Y8AsgTpgAmY7PhCfg%7EfA9rXquZc5IkJpXC-Ycixw&d_sfrom=search_unknown&d_ckId=a480a26be5e22fead4318a487a8710a0&d_curPage=0&d_pageSize=40&d_headId=a480a26be5e22fead4318a487a8710a0',\n",
       " 'AI创新成长50强 ': '/zhaopin/?init=-1&headckid=8866e146c48f66fa&flushckid=1&fromSearchBtn=2&keyword=PRD&compTag=189&ckid=8866e146c48f66fa&siTag=1B2M2Y8AsgTpgAmY7PhCfg%7EfA9rXquZc5IkJpXC-Ycixw&d_sfrom=search_unknown&d_ckId=a480a26be5e22fead4318a487a8710a0&d_curPage=0&d_pageSize=40&d_headId=a480a26be5e22fead4318a487a8710a0',\n",
       " '独角兽': '/zhaopin/?init=-1&headckid=8866e146c48f66fa&flushckid=1&fromSearchBtn=2&keyword=PRD&compTag=130&ckid=8866e146c48f66fa&siTag=1B2M2Y8AsgTpgAmY7PhCfg%7EfA9rXquZc5IkJpXC-Ycixw&d_sfrom=search_unknown&d_ckId=a480a26be5e22fead4318a487a8710a0&d_curPage=0&d_pageSize=40&d_headId=a480a26be5e22fead4318a487a8710a0',\n",
       " '上市公司': '/zhaopin/?init=-1&headckid=8866e146c48f66fa&flushckid=1&fromSearchBtn=2&keyword=PRD&compTag=156&ckid=8866e146c48f66fa&siTag=1B2M2Y8AsgTpgAmY7PhCfg%7EfA9rXquZc5IkJpXC-Ycixw&d_sfrom=search_unknown&d_ckId=a480a26be5e22fead4318a487a8710a0&d_curPage=0&d_pageSize=40&d_headId=a480a26be5e22fead4318a487a8710a0'}"
      ]
     },
     "execution_count": 5,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "# A-2 扩张 公司 ?  \n",
    "\n",
    "# 先取特定元素, 精准打击其子后辈\n",
    "主要元素 = r.html.xpath('//div[@data-selector=\"search-conditions\"]')\n",
    "# 预期是一个元素的列表？\n",
    "print (主要元素)\n",
    "print (主要元素[0])\n",
    "print (主要元素[0].xpath('//dt[@class=\"search-title\"]'))\n",
    "\n",
    "list_search_title = 主要元素[0].xpath('//dt[@class=\"search-title\"]')\n",
    "for x in list_search_title:\n",
    "    print (x.text)\n",
    "    \n",
    "list_search_dd = 主要元素[0].xpath('//dt[@class=\"search-title\"]/following-sibling::dd')\n",
    "for x in list_search_dd:\n",
    "    print (x)  \n",
    "    \n",
    "\n",
    "公司数据选择器链结 = r.html.xpath('//div[@data-selector=\"search-conditions\"]')[0] \\\n",
    "                    .xpath('//dt[@class=\"search-title\"]/following-sibling::dd')[0] \\\n",
    "                    .xpath('//div[contains(@class,\"hot-comp-tags\")]/a/@href')\n",
    "               \n",
    "公司数据选择器链结\n",
    "\n",
    "# 但我们需要知道这些选择器链结, 对映到什麽数据\n",
    "公司数据选择器链结 = r.html.xpath('//div[@data-selector=\"search-conditions\"]')[0] \\\n",
    "                    .xpath('//dt[@class=\"search-title\"]/following-sibling::dd')[0] \\\n",
    "                    .xpath('//div[contains(@class,\"hot-comp-tags\")]/a')\n",
    "公司数据选择器链结\n",
    "\n",
    "#[ x.xpath(\"a/@href\")[0] for x in 公司数据选择器链结]\n",
    "#[ x.xpath(\"a/text()\")[0] for x in 公司数据选择器链结]\n",
    "公司数据选择器链结 = { x.xpath(\"a/text()\")[0]:x.xpath(\"a/@href\")[0] for x in 公司数据选择器链结}\n",
    "公司数据选择器链结"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "## 使用urllib3 解析 url \n",
    "上面的url应该触动不同的页面查询，但能不能轻松无误的拆分url并进行比较？\n",
    "\n",
    "### urllib模块功能介绍\n",
    "* urlparse \n",
    "返回的6个部分，分别是：scheme(机制)丶netloc(网络位置)丶path(路径)丶params(路径段参数)丶query(查询)丶fragment(片段)。\n",
    "* parse_qs\n",
    "返回query(查询)多个部分"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 12,
   "metadata": {},
   "outputs": [
    {
     "data": {
      "text/plain": [
       "[ParseResult(scheme='', netloc='', path='/zhaopin/', params='', query='init=-1&headckid=8866e146c48f66fa&flushckid=1&fromSearchBtn=2&keyword=PRD&compTag=155&ckid=8866e146c48f66fa&siTag=1B2M2Y8AsgTpgAmY7PhCfg%7EfA9rXquZc5IkJpXC-Ycixw&d_sfrom=search_unknown&d_ckId=a480a26be5e22fead4318a487a8710a0&d_curPage=0&d_pageSize=40&d_headId=a480a26be5e22fead4318a487a8710a0', fragment=''),\n",
       " ParseResult(scheme='', netloc='', path='/zhaopin/', params='', query='init=-1&headckid=8866e146c48f66fa&flushckid=1&fromSearchBtn=2&keyword=PRD&compTag=182&ckid=8866e146c48f66fa&siTag=1B2M2Y8AsgTpgAmY7PhCfg%7EfA9rXquZc5IkJpXC-Ycixw&d_sfrom=search_unknown&d_ckId=a480a26be5e22fead4318a487a8710a0&d_curPage=0&d_pageSize=40&d_headId=a480a26be5e22fead4318a487a8710a0', fragment=''),\n",
       " ParseResult(scheme='', netloc='', path='/zhaopin/', params='', query='init=-1&headckid=8866e146c48f66fa&flushckid=1&fromSearchBtn=2&keyword=PRD&compTag=186&ckid=8866e146c48f66fa&siTag=1B2M2Y8AsgTpgAmY7PhCfg%7EfA9rXquZc5IkJpXC-Ycixw&d_sfrom=search_unknown&d_ckId=a480a26be5e22fead4318a487a8710a0&d_curPage=0&d_pageSize=40&d_headId=a480a26be5e22fead4318a487a8710a0', fragment=''),\n",
       " ParseResult(scheme='', netloc='', path='/zhaopin/', params='', query='init=-1&headckid=8866e146c48f66fa&flushckid=1&fromSearchBtn=2&keyword=PRD&compTag=189&ckid=8866e146c48f66fa&siTag=1B2M2Y8AsgTpgAmY7PhCfg%7EfA9rXquZc5IkJpXC-Ycixw&d_sfrom=search_unknown&d_ckId=a480a26be5e22fead4318a487a8710a0&d_curPage=0&d_pageSize=40&d_headId=a480a26be5e22fead4318a487a8710a0', fragment=''),\n",
       " ParseResult(scheme='', netloc='', path='/zhaopin/', params='', query='init=-1&headckid=8866e146c48f66fa&flushckid=1&fromSearchBtn=2&keyword=PRD&compTag=130&ckid=8866e146c48f66fa&siTag=1B2M2Y8AsgTpgAmY7PhCfg%7EfA9rXquZc5IkJpXC-Ycixw&d_sfrom=search_unknown&d_ckId=a480a26be5e22fead4318a487a8710a0&d_curPage=0&d_pageSize=40&d_headId=a480a26be5e22fead4318a487a8710a0', fragment=''),\n",
       " ParseResult(scheme='', netloc='', path='/zhaopin/', params='', query='init=-1&headckid=8866e146c48f66fa&flushckid=1&fromSearchBtn=2&keyword=PRD&compTag=156&ckid=8866e146c48f66fa&siTag=1B2M2Y8AsgTpgAmY7PhCfg%7EfA9rXquZc5IkJpXC-Ycixw&d_sfrom=search_unknown&d_ckId=a480a26be5e22fead4318a487a8710a0&d_curPage=0&d_pageSize=40&d_headId=a480a26be5e22fead4318a487a8710a0', fragment='')]"
      ]
     },
     "execution_count": 12,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "# B-1 使用 urllib.parse 解析\n",
    "from urllib.parse import urlparse, parse_qs\n",
    "[ urlparse(x) for x in 公司数据选择器链结.values()]"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 13,
   "metadata": {},
   "outputs": [
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
      "<class 'pandas.core.frame.DataFrame'>\n",
      "RangeIndex: 6 entries, 0 to 5\n",
      "Data columns (total 6 columns):\n",
      " #   Column    Non-Null Count  Dtype \n",
      "---  ------    --------------  ----- \n",
      " 0   scheme    6 non-null      object\n",
      " 1   netloc    6 non-null      object\n",
      " 2   path      6 non-null      object\n",
      " 3   params    6 non-null      object\n",
      " 4   query     6 non-null      object\n",
      " 5   fragment  6 non-null      object\n",
      "dtypes: object(6)\n",
      "memory usage: 416.0+ bytes\n",
      "scheme      1\n",
      "netloc      1\n",
      "path        1\n",
      "params      1\n",
      "query       6\n",
      "fragment    1\n",
      "dtype: int64\n"
     ]
    },
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th>scheme</th>\n",
       "      <th>netloc</th>\n",
       "      <th>path</th>\n",
       "      <th>params</th>\n",
       "      <th>query</th>\n",
       "      <th>fragment</th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <th>0</th>\n",
       "      <td></td>\n",
       "      <td></td>\n",
       "      <td>/zhaopin/</td>\n",
       "      <td></td>\n",
       "      <td>init=-1&amp;headckid=8866e146c48f66fa&amp;flushckid=1&amp;...</td>\n",
       "      <td></td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "</div>"
      ],
      "text/plain": [
       "  scheme netloc       path params  \\\n",
       "0                /zhaopin/          \n",
       "\n",
       "                                               query fragment  \n",
       "0  init=-1&headckid=8866e146c48f66fa&flushckid=1&...           "
      ]
     },
     "execution_count": 13,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "# B-2 使用 pd.DataFrame进行 unuinque()相异值计量比对 \n",
    "import pandas as pd\n",
    "df = pd.DataFrame([ urlparse(x) for x in 公司数据选择器链结.values()])\n",
    "df.info()\n",
    "print(df.nunique())\n",
    "df.head(1)"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 14,
   "metadata": {},
   "outputs": [
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
      "init             1\n",
      "headckid         1\n",
      "flushckid        1\n",
      "fromSearchBtn    1\n",
      "keyword          1\n",
      "compTag          6\n",
      "ckid             1\n",
      "siTag            1\n",
      "d_sfrom          1\n",
      "d_ckId           1\n",
      "d_curPage        1\n",
      "d_pageSize       1\n",
      "d_headId         1\n",
      "dtype: int64\n"
     ]
    },
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th>keyword</th>\n",
       "      <th>compTag</th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <th>0</th>\n",
       "      <td>PRD</td>\n",
       "      <td>155</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>1</th>\n",
       "      <td>PRD</td>\n",
       "      <td>182</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>2</th>\n",
       "      <td>PRD</td>\n",
       "      <td>186</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>3</th>\n",
       "      <td>PRD</td>\n",
       "      <td>189</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>4</th>\n",
       "      <td>PRD</td>\n",
       "      <td>130</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>5</th>\n",
       "      <td>PRD</td>\n",
       "      <td>156</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "</div>"
      ],
      "text/plain": [
       "  keyword compTag\n",
       "0     PRD     155\n",
       "1     PRD     182\n",
       "2     PRD     186\n",
       "3     PRD     189\n",
       "4     PRD     130\n",
       "5     PRD     156"
      ]
     },
     "execution_count": 14,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "# B-3 针对query 再解析之 \n",
    "#df_qs = pd.DataFrame([ parse_qs(x) for x in df['query'] ])\n",
    "df_qs = pd.DataFrame([{k:v[0] for k,v in parse_qs(x).items()} for x in df['query'] ])\n",
    "print (df_qs.nunique())\n",
    "df_qs.head()\n",
    "df_qs[['keyword','compTag']]"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 18,
   "metadata": {},
   "outputs": [
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
      "[{'init': '-1', 'headckid': '8866e146c48f66fa', 'flushckid': '1', 'fromSearchBtn': '2', 'keyword': 'PRD', 'compTag': '155', 'ckid': '8866e146c48f66fa', 'siTag': '1B2M2Y8AsgTpgAmY7PhCfg~fA9rXquZc5IkJpXC-Ycixw', 'd_sfrom': 'search_unknown', 'd_ckId': 'a480a26be5e22fead4318a487a8710a0', 'd_curPage': '0', 'd_pageSize': '40', 'd_headId': 'a480a26be5e22fead4318a487a8710a0'}, {'init': '-1', 'headckid': '8866e146c48f66fa', 'flushckid': '1', 'fromSearchBtn': '2', 'keyword': 'PRD', 'compTag': '182', 'ckid': '8866e146c48f66fa', 'siTag': '1B2M2Y8AsgTpgAmY7PhCfg~fA9rXquZc5IkJpXC-Ycixw', 'd_sfrom': 'search_unknown', 'd_ckId': 'a480a26be5e22fead4318a487a8710a0', 'd_curPage': '0', 'd_pageSize': '40', 'd_headId': 'a480a26be5e22fead4318a487a8710a0'}, {'init': '-1', 'headckid': '8866e146c48f66fa', 'flushckid': '1', 'fromSearchBtn': '2', 'keyword': 'PRD', 'compTag': '186', 'ckid': '8866e146c48f66fa', 'siTag': '1B2M2Y8AsgTpgAmY7PhCfg~fA9rXquZc5IkJpXC-Ycixw', 'd_sfrom': 'search_unknown', 'd_ckId': 'a480a26be5e22fead4318a487a8710a0', 'd_curPage': '0', 'd_pageSize': '40', 'd_headId': 'a480a26be5e22fead4318a487a8710a0'}, {'init': '-1', 'headckid': '8866e146c48f66fa', 'flushckid': '1', 'fromSearchBtn': '2', 'keyword': 'PRD', 'compTag': '189', 'ckid': '8866e146c48f66fa', 'siTag': '1B2M2Y8AsgTpgAmY7PhCfg~fA9rXquZc5IkJpXC-Ycixw', 'd_sfrom': 'search_unknown', 'd_ckId': 'a480a26be5e22fead4318a487a8710a0', 'd_curPage': '0', 'd_pageSize': '40', 'd_headId': 'a480a26be5e22fead4318a487a8710a0'}, {'init': '-1', 'headckid': '8866e146c48f66fa', 'flushckid': '1', 'fromSearchBtn': '2', 'keyword': 'PRD', 'compTag': '130', 'ckid': '8866e146c48f66fa', 'siTag': '1B2M2Y8AsgTpgAmY7PhCfg~fA9rXquZc5IkJpXC-Ycixw', 'd_sfrom': 'search_unknown', 'd_ckId': 'a480a26be5e22fead4318a487a8710a0', 'd_curPage': '0', 'd_pageSize': '40', 'd_headId': 'a480a26be5e22fead4318a487a8710a0'}, {'init': '-1', 'headckid': '8866e146c48f66fa', 'flushckid': '1', 'fromSearchBtn': '2', 'keyword': 'PRD', 'compTag': '156', 'ckid': '8866e146c48f66fa', 'siTag': '1B2M2Y8AsgTpgAmY7PhCfg~fA9rXquZc5IkJpXC-Ycixw', 'd_sfrom': 'search_unknown', 'd_ckId': 'a480a26be5e22fead4318a487a8710a0', 'd_curPage': '0', 'd_pageSize': '40', 'd_headId': 'a480a26be5e22fead4318a487a8710a0'}] {'init': '-1', 'headckid': '8866e146c48f66fa', 'flushckid': '1', 'fromSearchBtn': '2', 'keyword': 'PRD', 'compTag': '156', 'ckid': '8866e146c48f66fa', 'siTag': '1B2M2Y8AsgTpgAmY7PhCfg~fA9rXquZc5IkJpXC-Ycixw', 'd_sfrom': 'search_unknown', 'd_ckId': 'a480a26be5e22fead4318a487a8710a0', 'd_curPage': '0', 'd_pageSize': '40', 'd_headId': 'a480a26be5e22fead4318a487a8710a0'}\n"
     ]
    },
    {
     "data": {
      "text/plain": [
       "\"python 原代碼以下不處理\\ndf_qs = pd.DataFrame(list_query)\\nprint (df_qs.nunique())\\ndf_qs.head()\\ndf_qs[['keyword','compTag']]\\n\""
      ]
     },
     "execution_count": 18,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "# B-3-X 对 B-3 代码的字典/列表推导的分拆说明\n",
    "\n",
    "# ----------------------------------------------\n",
    "# 列表暂存 = [{k:v[0] for k,v in parse_qs(q).items()} for q in df['query'] ]\n",
    "# 以下3行代碼相當於上面推導1行\n",
    "列表暂存 = [] # list()\n",
    "for x in df['query']: \n",
    "    列表暂存.append({k:v[0] for k,v in parse_qs(x).items()} )\n",
    "    \n",
    "# ----------------------------------------------\n",
    "# 字典暂存 = {k:v[0] for k,v in parse_qs(x).items()}\n",
    "# 以下3行代碼相當於上面推導1行\n",
    "字典暂存 = dict()\n",
    "for k,v in parse_qs(x).items():           # for 键,值 in 字典.items():\n",
    "    字典暂存.update({k:v[0]})\n",
    "    \n",
    "\n",
    "# ----------------------------------------------\n",
    "print (列表暂存, 字典暂存) \n",
    "\n",
    "'''python 原代碼以下不處理\n",
    "df_qs = pd.DataFrame(list_query)\n",
    "print (df_qs.nunique())\n",
    "df_qs.head()\n",
    "df_qs[['keyword','compTag']]\n",
    "'''"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 21,
   "metadata": {},
   "outputs": [
    {
     "data": {
      "text/plain": [
       "[{'init': '-1',\n",
       "  'headckid': '8866e146c48f66fa',\n",
       "  'flushckid': '1',\n",
       "  'fromSearchBtn': '2',\n",
       "  'keyword': 'PRD',\n",
       "  'compTag': '155',\n",
       "  'ckid': '8866e146c48f66fa',\n",
       "  'siTag': '1B2M2Y8AsgTpgAmY7PhCfg~fA9rXquZc5IkJpXC-Ycixw',\n",
       "  'd_sfrom': 'search_unknown',\n",
       "  'd_ckId': 'a480a26be5e22fead4318a487a8710a0',\n",
       "  'd_curPage': '0',\n",
       "  'd_pageSize': '40',\n",
       "  'd_headId': 'a480a26be5e22fead4318a487a8710a0'},\n",
       " {'init': '-1',\n",
       "  'headckid': '8866e146c48f66fa',\n",
       "  'flushckid': '1',\n",
       "  'fromSearchBtn': '2',\n",
       "  'keyword': 'PRD',\n",
       "  'compTag': '182',\n",
       "  'ckid': '8866e146c48f66fa',\n",
       "  'siTag': '1B2M2Y8AsgTpgAmY7PhCfg~fA9rXquZc5IkJpXC-Ycixw',\n",
       "  'd_sfrom': 'search_unknown',\n",
       "  'd_ckId': 'a480a26be5e22fead4318a487a8710a0',\n",
       "  'd_curPage': '0',\n",
       "  'd_pageSize': '40',\n",
       "  'd_headId': 'a480a26be5e22fead4318a487a8710a0'},\n",
       " {'init': '-1',\n",
       "  'headckid': '8866e146c48f66fa',\n",
       "  'flushckid': '1',\n",
       "  'fromSearchBtn': '2',\n",
       "  'keyword': 'PRD',\n",
       "  'compTag': '186',\n",
       "  'ckid': '8866e146c48f66fa',\n",
       "  'siTag': '1B2M2Y8AsgTpgAmY7PhCfg~fA9rXquZc5IkJpXC-Ycixw',\n",
       "  'd_sfrom': 'search_unknown',\n",
       "  'd_ckId': 'a480a26be5e22fead4318a487a8710a0',\n",
       "  'd_curPage': '0',\n",
       "  'd_pageSize': '40',\n",
       "  'd_headId': 'a480a26be5e22fead4318a487a8710a0'},\n",
       " {'init': '-1',\n",
       "  'headckid': '8866e146c48f66fa',\n",
       "  'flushckid': '1',\n",
       "  'fromSearchBtn': '2',\n",
       "  'keyword': 'PRD',\n",
       "  'compTag': '189',\n",
       "  'ckid': '8866e146c48f66fa',\n",
       "  'siTag': '1B2M2Y8AsgTpgAmY7PhCfg~fA9rXquZc5IkJpXC-Ycixw',\n",
       "  'd_sfrom': 'search_unknown',\n",
       "  'd_ckId': 'a480a26be5e22fead4318a487a8710a0',\n",
       "  'd_curPage': '0',\n",
       "  'd_pageSize': '40',\n",
       "  'd_headId': 'a480a26be5e22fead4318a487a8710a0'},\n",
       " {'init': '-1',\n",
       "  'headckid': '8866e146c48f66fa',\n",
       "  'flushckid': '1',\n",
       "  'fromSearchBtn': '2',\n",
       "  'keyword': 'PRD',\n",
       "  'compTag': '130',\n",
       "  'ckid': '8866e146c48f66fa',\n",
       "  'siTag': '1B2M2Y8AsgTpgAmY7PhCfg~fA9rXquZc5IkJpXC-Ycixw',\n",
       "  'd_sfrom': 'search_unknown',\n",
       "  'd_ckId': 'a480a26be5e22fead4318a487a8710a0',\n",
       "  'd_curPage': '0',\n",
       "  'd_pageSize': '40',\n",
       "  'd_headId': 'a480a26be5e22fead4318a487a8710a0'},\n",
       " {'init': '-1',\n",
       "  'headckid': '8866e146c48f66fa',\n",
       "  'flushckid': '1',\n",
       "  'fromSearchBtn': '2',\n",
       "  'keyword': 'PRD',\n",
       "  'compTag': '156',\n",
       "  'ckid': '8866e146c48f66fa',\n",
       "  'siTag': '1B2M2Y8AsgTpgAmY7PhCfg~fA9rXquZc5IkJpXC-Ycixw',\n",
       "  'd_sfrom': 'search_unknown',\n",
       "  'd_ckId': 'a480a26be5e22fead4318a487a8710a0',\n",
       "  'd_curPage': '0',\n",
       "  'd_pageSize': '40',\n",
       "  'd_headId': 'a480a26be5e22fead4318a487a8710a0'}]"
      ]
     },
     "execution_count": 21,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "# B-3-XX 整合后\n",
    "\n",
    "列表暫存 = [] # list()\n",
    "for q in df['query']: \n",
    "    字典暫存 = dict()\n",
    "    for k,v in parse_qs(q).items(): # for 鍵,值 in 字典.items():\n",
    "        字典暫存.update({k:v[0]})\n",
    "    列表暫存.append(字典暫存)\n",
    "列表暫存\n",
    "# ----------------------------------------------\n",
    "# 比較\n",
    "# 列表暫存 = [{k:v[0] for k,v in parse_qs(q).items()} for q in df['query'] ]"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "### 小结\n",
    "* comTag 是不同的公司选择器, 数值不样, 对映到不同类型的公司\n",
    "* keyword 是搜查关键字"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 22,
   "metadata": {},
   "outputs": [
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
      "{'init': ['-1'], 'headckid': ['8866e146c48f66fa'], 'flushckid': ['1'], 'fromSearchBtn': ['2'], 'keyword': ['PRD'], 'compTag': ['155'], 'ckid': ['8866e146c48f66fa'], 'siTag': ['1B2M2Y8AsgTpgAmY7PhCfg~fA9rXquZc5IkJpXC-Ycixw'], 'd_sfrom': ['search_unknown'], 'd_ckId': ['a480a26be5e22fead4318a487a8710a0'], 'd_curPage': ['0'], 'd_pageSize': ['40'], 'd_headId': ['a480a26be5e22fead4318a487a8710a0']}\n",
      "{'中国500强': '155', '2018互联网300强': '182', '制造业500强': '186', 'AI创新成长50强 ': '189', '独角兽': '130', '上市公司': '156'}\n"
     ]
    }
   ],
   "source": [
    "# B-4 建构 参数模板 及 字典_compTag\n",
    "def parse_url_qs_for_compTag (url):\n",
    "    six_parts = urlparse(url) \n",
    "    out = parse_qs(six_parts.query)\n",
    "    return (out)\n",
    "\n",
    "# parse_url_qs_for_compTag(list(公司数据选择器链结.values())[0])['compTag']\n",
    "参数模板 = parse_url_qs_for_compTag(list(公司数据选择器链结.values())[0])\n",
    "print(参数模板)\n",
    "# [ parse_url_qs_for_compTag(x)['compTag'] for x in 公司数据选择器链结.values()]\n",
    "[ parse_url_qs_for_compTag(x)['compTag'][0] for x in 公司数据选择器链结.values()]\n",
    "\n",
    "字典_compTag = { k:parse_url_qs_for_compTag(v)['compTag'][0] for k,v in 公司数据选择器链结.items()}\n",
    "print (字典_compTag)\n"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 23,
   "metadata": {},
   "outputs": [
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
      "{'中国500强': {'init': ['-1'], 'headckid': ['8866e146c48f66fa'], 'flushckid': ['1'], 'fromSearchBtn': ['2'], 'keyword': ['用户体验'], 'compTag': ['155'], 'ckid': ['8866e146c48f66fa'], 'siTag': ['1B2M2Y8AsgTpgAmY7PhCfg~fA9rXquZc5IkJpXC-Ycixw'], 'd_sfrom': ['search_unknown'], 'd_ckId': ['a480a26be5e22fead4318a487a8710a0'], 'd_curPage': ['0'], 'd_pageSize': ['40'], 'd_headId': ['a480a26be5e22fead4318a487a8710a0']}, '2018互联网300强': {'init': ['-1'], 'headckid': ['8866e146c48f66fa'], 'flushckid': ['1'], 'fromSearchBtn': ['2'], 'keyword': ['用户体验'], 'compTag': ['182'], 'ckid': ['8866e146c48f66fa'], 'siTag': ['1B2M2Y8AsgTpgAmY7PhCfg~fA9rXquZc5IkJpXC-Ycixw'], 'd_sfrom': ['search_unknown'], 'd_ckId': ['a480a26be5e22fead4318a487a8710a0'], 'd_curPage': ['0'], 'd_pageSize': ['40'], 'd_headId': ['a480a26be5e22fead4318a487a8710a0']}, '制造业500强': {'init': ['-1'], 'headckid': ['8866e146c48f66fa'], 'flushckid': ['1'], 'fromSearchBtn': ['2'], 'keyword': ['用户体验'], 'compTag': ['186'], 'ckid': ['8866e146c48f66fa'], 'siTag': ['1B2M2Y8AsgTpgAmY7PhCfg~fA9rXquZc5IkJpXC-Ycixw'], 'd_sfrom': ['search_unknown'], 'd_ckId': ['a480a26be5e22fead4318a487a8710a0'], 'd_curPage': ['0'], 'd_pageSize': ['40'], 'd_headId': ['a480a26be5e22fead4318a487a8710a0']}, 'AI创新成长50强 ': {'init': ['-1'], 'headckid': ['8866e146c48f66fa'], 'flushckid': ['1'], 'fromSearchBtn': ['2'], 'keyword': ['用户体验'], 'compTag': ['189'], 'ckid': ['8866e146c48f66fa'], 'siTag': ['1B2M2Y8AsgTpgAmY7PhCfg~fA9rXquZc5IkJpXC-Ycixw'], 'd_sfrom': ['search_unknown'], 'd_ckId': ['a480a26be5e22fead4318a487a8710a0'], 'd_curPage': ['0'], 'd_pageSize': ['40'], 'd_headId': ['a480a26be5e22fead4318a487a8710a0']}, '独角兽': {'init': ['-1'], 'headckid': ['8866e146c48f66fa'], 'flushckid': ['1'], 'fromSearchBtn': ['2'], 'keyword': ['用户体验'], 'compTag': ['130'], 'ckid': ['8866e146c48f66fa'], 'siTag': ['1B2M2Y8AsgTpgAmY7PhCfg~fA9rXquZc5IkJpXC-Ycixw'], 'd_sfrom': ['search_unknown'], 'd_ckId': ['a480a26be5e22fead4318a487a8710a0'], 'd_curPage': ['0'], 'd_pageSize': ['40'], 'd_headId': ['a480a26be5e22fead4318a487a8710a0']}, '上市公司': {'init': ['-1'], 'headckid': ['8866e146c48f66fa'], 'flushckid': ['1'], 'fromSearchBtn': ['2'], 'keyword': ['用户体验'], 'compTag': ['156'], 'ckid': ['8866e146c48f66fa'], 'siTag': ['1B2M2Y8AsgTpgAmY7PhCfg~fA9rXquZc5IkJpXC-Ycixw'], 'd_sfrom': ['search_unknown'], 'd_ckId': ['a480a26be5e22fead4318a487a8710a0'], 'd_curPage': ['0'], 'd_pageSize': ['40'], 'd_headId': ['a480a26be5e22fead4318a487a8710a0']}}\n"
     ]
    }
   ],
   "source": [
    "# B-5 建构 参数模板  \n",
    "def 参数模板生成(compTag , keyword ):\n",
    "    参数 = 参数模板.copy()\n",
    "    参数['compTag'] = compTag\n",
    "    参数['keyword'] = keyword\n",
    "    return (参数)\n",
    "\n",
    "参数_compTag_用户体验 = { k:参数模板生成(compTag = [v], keyword = ['用户体验']) for k,v in 字典_compTag.items()}\n",
    "print(参数_compTag_用户体验)"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "## requests 生成"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 24,
   "metadata": {},
   "outputs": [
    {
     "data": {
      "text/plain": [
       "'https://www.liepin.com/zhaopin/?init=-1&headckid=8866e146c48f66fa&flushckid=1&fromSearchBtn=2&keyword=%E7%94%A8%E6%88%B7%E4%BD%93%E9%AA%8C&compTag=155&ckid=8866e146c48f66fa&siTag=1B2M2Y8AsgTpgAmY7PhCfg~fA9rXquZc5IkJpXC-Ycixw&d_sfrom=search_unknown&d_ckId=a480a26be5e22fead4318a487a8710a0&d_curPage=0&d_pageSize=40&d_headId=a480a26be5e22fead4318a487a8710a0'"
      ]
     },
     "execution_count": 24,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "# C-1   多个页面准备测试1 中国500强\n",
    "url = \"https://www.liepin.com/zhaopin/\"\n",
    "session = HTMLSession()\n",
    "payload = 参数_compTag_用户体验['中国500强']\n",
    "r = session.get( url, params = payload)\n",
    "r.url"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 25,
   "metadata": {},
   "outputs": [],
   "source": [
    "# C-2  简化 A-1   单一页面爬+解析\n",
    "session = HTMLSession()\n",
    "\n",
    "def requests_liepin( url, params):\n",
    "    r = session.get( url , params = payload)\n",
    "\n",
    "    # 先取特定元素, 精准打击其子后辈\n",
    "    主要元素 = r.html.xpath( '//ul[@class=\"sojob-list\"]/li')\n",
    "\n",
    "    # 作为xpath字典，键为我要抓的牛肉名称，值为xpath\n",
    "    dict_xpaths={ \n",
    "        'text': {\n",
    "            'edu':      '//div[contains(@class,\"job-info\")]/p/span[@class=\"edu\"]',\n",
    "            '经验':      '//div[contains(@class,\"job-info\")]/p/span[@class=\"edu\"]/following-sibling::span',\n",
    "            '薪水':    '//div[contains(@class,\"job-info\")]/p/span[@class=\"text-warning\"]', \n",
    "            '时间':    '//div[contains(@class,\"job-info\")]/p/time/@title', \n",
    "            '职称':    '//div[contains(@class,\"job-info\")]/h3/a', \n",
    "            '公司地点': '//div[contains(@class,\"job-info\")]/p/a',\n",
    "            '公司名称': '//div[contains(@class,\"sojob-item-main\")]//p[@class=\"company-name\"]/a', \n",
    "        },\n",
    "        'text_content': {\n",
    "        },\n",
    "        'href': {\n",
    "            '链结':    '//div[contains(@class,\"job-info\")]/h3/a', \n",
    "            '公司URL': '//div[contains(@class,\"sojob-item-main\")]//p[@class=\"company-name\"]/a', \n",
    "        }\n",
    "    }\n",
    "\n",
    "    def get_e_text_content(_xpath_):\n",
    "        # 高级列表推导\n",
    "        暂存结果 = [e.xpath(_xpath_)[0].lxml.text_content() for e in 主要元素]\n",
    "        return(暂存结果)\n",
    "\n",
    "    def get_e_text(_xpath_):\n",
    "        # 高级列表推导\n",
    "        暂存结果 = [\"\".join([x.strip() if type(x) is str else x.text.strip() for x in e.xpath(_xpath_)]) for e in 主要元素]\n",
    "        return(暂存结果)\n",
    "\n",
    "    def get_e_href(_xpath_):\n",
    "        # 高级列表推导\n",
    "        暂存结果 = [list(e.xpath(_xpath_, first=True).absolute_links)[0] \\\n",
    "                   if len(e.xpath(_xpath_, first=True).absolute_links) >= 1  \\\n",
    "                   else \"\" for e in 主要元素]\n",
    "        return(暂存结果)\n",
    "\n",
    "    # 只对主要元素下进行.xpath取值\n",
    "    数据字典 = dict()\n",
    "\n",
    "    数据字典 = {k:get_e_text_content(v) for k,v in dict_xpaths['text_content'].items()}\n",
    "    数据字典.update({k:get_e_text(v) for k,v in dict_xpaths['text'].items()})\n",
    "    数据字典.update({k:get_e_href(v) for k,v in dict_xpaths['href'].items()})\n",
    "\n",
    "    数据 = pd.DataFrame(数据字典)\n",
    "    #数据.to_excel(\"20春_Web数据挖掘_week03_liepin.xlsx\", sheet_name=\"搜查结果\")\n",
    "    return (数据)\n",
    "\n",
    "\n"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 26,
   "metadata": {},
   "outputs": [
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th>edu</th>\n",
       "      <th>经验</th>\n",
       "      <th>薪水</th>\n",
       "      <th>时间</th>\n",
       "      <th>职称</th>\n",
       "      <th>公司地点</th>\n",
       "      <th>公司名称</th>\n",
       "      <th>链结</th>\n",
       "      <th>公司URL</th>\n",
       "      <th>热门公司类型</th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <th>0</th>\n",
       "      <td>本科及以上</td>\n",
       "      <td>3年以上</td>\n",
       "      <td>面议</td>\n",
       "      <td>2020年03月31日</td>\n",
       "      <td>大客户销售经理-北京-网易严选</td>\n",
       "      <td>北京-五道口</td>\n",
       "      <td>网易集团</td>\n",
       "      <td>https://www.liepin.com/job/1926756751.shtml</td>\n",
       "      <td>https://www.liepin.com/company/5964833/</td>\n",
       "      <td>中国500强</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>1</th>\n",
       "      <td>统招本科</td>\n",
       "      <td>3年以上</td>\n",
       "      <td>面议</td>\n",
       "      <td>2020年03月30日</td>\n",
       "      <td>阿里云智能事业群-数据技术专家(金融行业)-北京/杭州</td>\n",
       "      <td>杭州</td>\n",
       "      <td>阿里巴巴</td>\n",
       "      <td>https://www.liepin.com/job/1927063431.shtml</td>\n",
       "      <td>https://www.liepin.com/company/1072424/</td>\n",
       "      <td>中国500强</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>2</th>\n",
       "      <td>本科及以上</td>\n",
       "      <td>3年以上</td>\n",
       "      <td>面议</td>\n",
       "      <td>2020年03月27日</td>\n",
       "      <td>钉钉(Dingtalk)-搜索中心-Java开发技术专家</td>\n",
       "      <td>杭州</td>\n",
       "      <td>阿里巴巴</td>\n",
       "      <td>https://www.liepin.com/job/1926996383.shtml</td>\n",
       "      <td>https://www.liepin.com/company/1072424/</td>\n",
       "      <td>中国500强</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>3</th>\n",
       "      <td>大专及以上</td>\n",
       "      <td>2年以上</td>\n",
       "      <td>6-8k·13薪</td>\n",
       "      <td>2020年03月25日</td>\n",
       "      <td>员工关系专员</td>\n",
       "      <td>廊坊-广阳区</td>\n",
       "      <td>中国国际技术智力合作有限公司</td>\n",
       "      <td>https://www.liepin.com/job/1926938099.shtml</td>\n",
       "      <td>https://www.liepin.com/company/1233751/</td>\n",
       "      <td>中国500强</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>4</th>\n",
       "      <td>本科及以上</td>\n",
       "      <td>3年以上</td>\n",
       "      <td>25-50k·12薪</td>\n",
       "      <td>2020年03月24日</td>\n",
       "      <td>钉钉(DingTalk)-安全运营专家-安全产品及中心</td>\n",
       "      <td>杭州</td>\n",
       "      <td>阿里巴巴</td>\n",
       "      <td>https://www.liepin.com/job/1926923363.shtml</td>\n",
       "      <td>https://www.liepin.com/company/1072424/</td>\n",
       "      <td>中国500强</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>...</th>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>35</th>\n",
       "      <td>本科及以上</td>\n",
       "      <td>1年以上</td>\n",
       "      <td>5-8k·12薪</td>\n",
       "      <td>2020年03月26日</td>\n",
       "      <td>片区人力资源主任/专员</td>\n",
       "      <td>中山</td>\n",
       "      <td>碧桂园智慧物业服务集团股份有限公司</td>\n",
       "      <td>https://www.liepin.com/job/1919360705.shtml</td>\n",
       "      <td>https://www.liepin.com/company/8694860/</td>\n",
       "      <td>上市公司</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>36</th>\n",
       "      <td>本科及以上</td>\n",
       "      <td>6年以上</td>\n",
       "      <td>15-20k·13薪</td>\n",
       "      <td>2020年03月26日</td>\n",
       "      <td>法务经理/主任</td>\n",
       "      <td>深圳</td>\n",
       "      <td>中国南玻集团股份有限公司</td>\n",
       "      <td>https://www.liepin.com/job/1926955487.shtml</td>\n",
       "      <td>https://www.liepin.com/company/9091167/</td>\n",
       "      <td>上市公司</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>37</th>\n",
       "      <td>统招本科</td>\n",
       "      <td>10年以上</td>\n",
       "      <td>面议</td>\n",
       "      <td>2020年03月26日</td>\n",
       "      <td>CHO/HRD</td>\n",
       "      <td>上海</td>\n",
       "      <td>银科控股</td>\n",
       "      <td>https://www.liepin.com/job/1915800458.shtml</td>\n",
       "      <td>https://www.liepin.com/company/8582797/</td>\n",
       "      <td>上市公司</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>38</th>\n",
       "      <td>本科及以上</td>\n",
       "      <td>3年以上</td>\n",
       "      <td>20-30k·12薪</td>\n",
       "      <td>2020年03月25日</td>\n",
       "      <td>SAP 运维顾问</td>\n",
       "      <td>北京</td>\n",
       "      <td>科兴</td>\n",
       "      <td>https://www.liepin.com/job/1926949105.shtml</td>\n",
       "      <td>https://www.liepin.com/company/8593199/</td>\n",
       "      <td>上市公司</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>39</th>\n",
       "      <td>本科及以上</td>\n",
       "      <td>1年以上</td>\n",
       "      <td>10-20k·12薪</td>\n",
       "      <td>2020年03月25日</td>\n",
       "      <td>新闻短视频运营 (MJ000067)</td>\n",
       "      <td>北京</td>\n",
       "      <td>凤凰新媒体</td>\n",
       "      <td>https://www.liepin.com/job/1925965933.shtml</td>\n",
       "      <td>https://www.liepin.com/company/8139695/</td>\n",
       "      <td>上市公司</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "<p>240 rows × 10 columns</p>\n",
       "</div>"
      ],
      "text/plain": [
       "      edu     经验          薪水           时间                            职称  \\\n",
       "0   本科及以上   3年以上          面议  2020年03月31日               大客户销售经理-北京-网易严选   \n",
       "1    统招本科   3年以上          面议  2020年03月30日   阿里云智能事业群-数据技术专家(金融行业)-北京/杭州   \n",
       "2   本科及以上   3年以上          面议  2020年03月27日  钉钉(Dingtalk)-搜索中心-Java开发技术专家   \n",
       "3   大专及以上   2年以上    6-8k·13薪  2020年03月25日                        员工关系专员   \n",
       "4   本科及以上   3年以上  25-50k·12薪  2020年03月24日   钉钉(DingTalk)-安全运营专家-安全产品及中心   \n",
       "..    ...    ...         ...          ...                           ...   \n",
       "35  本科及以上   1年以上    5-8k·12薪  2020年03月26日                   片区人力资源主任/专员   \n",
       "36  本科及以上   6年以上  15-20k·13薪  2020年03月26日                       法务经理/主任   \n",
       "37   统招本科  10年以上          面议  2020年03月26日                       CHO/HRD   \n",
       "38  本科及以上   3年以上  20-30k·12薪  2020年03月25日                      SAP 运维顾问   \n",
       "39  本科及以上   1年以上  10-20k·12薪  2020年03月25日            新闻短视频运营 (MJ000067)   \n",
       "\n",
       "      公司地点               公司名称                                           链结  \\\n",
       "0   北京-五道口               网易集团  https://www.liepin.com/job/1926756751.shtml   \n",
       "1       杭州               阿里巴巴  https://www.liepin.com/job/1927063431.shtml   \n",
       "2       杭州               阿里巴巴  https://www.liepin.com/job/1926996383.shtml   \n",
       "3   廊坊-广阳区     中国国际技术智力合作有限公司  https://www.liepin.com/job/1926938099.shtml   \n",
       "4       杭州               阿里巴巴  https://www.liepin.com/job/1926923363.shtml   \n",
       "..     ...                ...                                          ...   \n",
       "35      中山  碧桂园智慧物业服务集团股份有限公司  https://www.liepin.com/job/1919360705.shtml   \n",
       "36      深圳       中国南玻集团股份有限公司  https://www.liepin.com/job/1926955487.shtml   \n",
       "37      上海               银科控股  https://www.liepin.com/job/1915800458.shtml   \n",
       "38      北京                 科兴  https://www.liepin.com/job/1926949105.shtml   \n",
       "39      北京              凤凰新媒体  https://www.liepin.com/job/1925965933.shtml   \n",
       "\n",
       "                                      公司URL  热门公司类型  \n",
       "0   https://www.liepin.com/company/5964833/  中国500强  \n",
       "1   https://www.liepin.com/company/1072424/  中国500强  \n",
       "2   https://www.liepin.com/company/1072424/  中国500强  \n",
       "3   https://www.liepin.com/company/1233751/  中国500强  \n",
       "4   https://www.liepin.com/company/1072424/  中国500强  \n",
       "..                                      ...     ...  \n",
       "35  https://www.liepin.com/company/8694860/    上市公司  \n",
       "36  https://www.liepin.com/company/9091167/    上市公司  \n",
       "37  https://www.liepin.com/company/8582797/    上市公司  \n",
       "38  https://www.liepin.com/company/8593199/    上市公司  \n",
       "39  https://www.liepin.com/company/8139695/    上市公司  \n",
       "\n",
       "[240 rows x 10 columns]"
      ]
     },
     "execution_count": 26,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "# C-3   多个页面\n",
    "url = \"https://www.liepin.com/zhaopin/\"\n",
    "\n",
    "list_df = list()\n",
    "for k,v in 参数_compTag_用户体验.items():\n",
    "    payload = v\n",
    "    df = requests_liepin( url, params = payload)\n",
    "    df = df.assign (热门公司类型 = k)    \n",
    "    list_df.append(df)\n",
    "\n",
    "df_all = pd.concat(list_df)\n",
    "df_all"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 27,
   "metadata": {},
   "outputs": [],
   "source": [
    "# C-4   输出\n",
    "df_all.to_excel(\"20春_Web数据挖掘_week03_liepin_各热门公司类型.xlsx\", sheet_name=\"搜查结果\")"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 28,
   "metadata": {},
   "outputs": [
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
      "edu         6\n",
      "经验         10\n",
      "薪水         78\n",
      "时间         31\n",
      "职称        182\n",
      "公司地点       82\n",
      "公司名称       59\n",
      "链结        196\n",
      "公司URL      59\n",
      "热门公司类型      6\n",
      "dtype: int64\n"
     ]
    },
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th>职称</th>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>公司名称</th>\n",
       "      <th>edu</th>\n",
       "      <th></th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <th rowspan=\"2\" valign=\"top\">华为</th>\n",
       "      <th>统招本科</th>\n",
       "      <td>30</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>本科及以上</th>\n",
       "      <td>18</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>科大讯飞</th>\n",
       "      <th>本科及以上</th>\n",
       "      <td>12</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>海尔智家</th>\n",
       "      <th>本科及以上</th>\n",
       "      <td>10</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>上海擎创信息技术有限公司</th>\n",
       "      <th>本科及以上</th>\n",
       "      <td>9</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>...</th>\n",
       "      <th>...</th>\n",
       "      <td>...</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>明略科技集团</th>\n",
       "      <th>统招本科</th>\n",
       "      <td>1</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>朴新教育</th>\n",
       "      <th>统招本科</th>\n",
       "      <td>1</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>柳工机械</th>\n",
       "      <th>统招本科</th>\n",
       "      <td>1</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>江南布衣</th>\n",
       "      <th>大专及以上</th>\n",
       "      <td>1</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>龙信集团</th>\n",
       "      <th>统招本科</th>\n",
       "      <td>1</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "<p>80 rows × 1 columns</p>\n",
       "</div>"
      ],
      "text/plain": [
       "                    职称\n",
       "公司名称         edu      \n",
       "华为           统招本科   30\n",
       "             本科及以上  18\n",
       "科大讯飞         本科及以上  12\n",
       "海尔智家         本科及以上  10\n",
       "上海擎创信息技术有限公司 本科及以上   9\n",
       "...                 ..\n",
       "明略科技集团       统招本科    1\n",
       "朴新教育         统招本科    1\n",
       "柳工机械         统招本科    1\n",
       "江南布衣         大专及以上   1\n",
       "龙信集团         统招本科    1\n",
       "\n",
       "[80 rows x 1 columns]"
      ]
     },
     "execution_count": 28,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "# C-5 Pandas  基本能力\n",
    "\n",
    "print (df_all.nunique())\n",
    "df_all[['edu']].drop_duplicates()\n",
    "\n",
    "df_all.groupby(['公司名称','edu']).agg({\"职称\":\"count\"}).sort_values(by='职称', ascending=False)"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 29,
   "metadata": {},
   "outputs": [
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th>职称</th>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>公司名称</th>\n",
       "      <th>edu</th>\n",
       "      <th></th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <th rowspan=\"2\" valign=\"top\">华为</th>\n",
       "      <th>统招本科</th>\n",
       "      <td>30</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>本科及以上</th>\n",
       "      <td>18</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>科大讯飞</th>\n",
       "      <th>本科及以上</th>\n",
       "      <td>12</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>海尔智家</th>\n",
       "      <th>本科及以上</th>\n",
       "      <td>10</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>上海擎创信息技术有限公司</th>\n",
       "      <th>本科及以上</th>\n",
       "      <td>9</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>...</th>\n",
       "      <th>...</th>\n",
       "      <td>...</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>明略科技集团</th>\n",
       "      <th>统招本科</th>\n",
       "      <td>1</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>朴新教育</th>\n",
       "      <th>统招本科</th>\n",
       "      <td>1</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>柳工机械</th>\n",
       "      <th>统招本科</th>\n",
       "      <td>1</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>江南布衣</th>\n",
       "      <th>大专及以上</th>\n",
       "      <td>1</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>龙信集团</th>\n",
       "      <th>统招本科</th>\n",
       "      <td>1</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "<p>80 rows × 1 columns</p>\n",
       "</div>"
      ],
      "text/plain": [
       "                    职称\n",
       "公司名称         edu      \n",
       "华为           统招本科   30\n",
       "             本科及以上  18\n",
       "科大讯飞         本科及以上  12\n",
       "海尔智家         本科及以上  10\n",
       "上海擎创信息技术有限公司 本科及以上   9\n",
       "...                 ..\n",
       "明略科技集团       统招本科    1\n",
       "朴新教育         统招本科    1\n",
       "柳工机械         统招本科    1\n",
       "江南布衣         大专及以上   1\n",
       "龙信集团         统招本科    1\n",
       "\n",
       "[80 rows x 1 columns]"
      ]
     },
     "execution_count": 29,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "df_all.groupby(['公司名称','edu']).agg({\"职称\":\"count\"}).sort_values(by='职称', ascending=False)"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "# 本周练习\n",
    "\n",
    "一样反向工程解析:\n",
    "\n",
    "## 上方界面的params参数\n",
    "* 公司：v\n",
    "* 行业：?\n",
    "* 城市：?\n",
    "* 薪资：?\n",
    "## 下方界面的params参数\n",
    "* 跳转到 N 页确定 ?\n",
    "## 换  \n",
    "* keyword\n"
   ]
  }
 ],
 "metadata": {
  "kernelspec": {
   "display_name": "Python 3",
   "language": "python",
   "name": "python3"
  },
  "language_info": {
   "codemirror_mode": {
    "name": "ipython",
    "version": 3
   },
   "file_extension": ".py",
   "mimetype": "text/x-python",
   "name": "python",
   "nbconvert_exporter": "python",
   "pygments_lexer": "ipython3",
   "version": "3.7.4"
  },
  "toc": {
   "base_numbering": 1,
   "nav_menu": {},
   "number_sections": true,
   "sideBar": true,
   "skip_h1_title": false,
   "title_cell": "Table of Contents",
   "title_sidebar": "Contents",
   "toc_cell": false,
   "toc_position": {
    "height": "749px",
    "left": "1125.609375px",
    "top": "110px",
    "width": "281.390625px"
   },
   "toc_section_display": true,
   "toc_window_display": true
  }
 },
 "nbformat": 4,
 "nbformat_minor": 2
}
